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Abstract 



The detectability of moons of extra-solar planets is investigated, focussing 
on the time-of-arrival perturbation technique, a method for detecting moons 
of pulsar planets, and the photometric transit timing technique, a method 
for detecting moons of transiting planets. Realistic thresholds are derived 
and analysed in the in the context of the types of moons that are likely to 
form and be orbitally stable for the lifetime of the system. 

For the case of the time-of-arrival perturbation technique, the analysis 
is conducted in two stages. First, a preliminary investigation is conducted 
assuming that planet and moon's orbit are circular and coplanar. This 
analysis is then applied to the case of the pulsar planet PSR B1620-26 b, 
and used to conclude that a stable moon orbiting this pulsar planet could 
be detected, if its mass was > 5% of its planet's mass (2.5 Mj up ), and if the 
planet-moon distance was ~ 2% of the planet-pulsar separation (23 AU). 
Time-of-arrival expressions are then derived for mutually inclined as well as 
non-circular orbits. 

For the case of the photometric transit timing technique, a different ap- 
proach is adopted. First, analytic expressions for the timing perturbation 
due to the moon are derived for the case where the orbit of the moon is 
circular and coplanar with that of the planet and where the planet's or- 
bit is circular and aligned to the line-of-sight, circular and inclined with 
respect to the line-of-sight or eccentric and aligned to the line-of-sight. It 
is found that when the velocity of the moon is small with respect to the 
velocity at which the planet-moon barycenter transits the star, that the 
timing perturbation could be well approximated by a sinusoid. Second, the 
timing noise is investigated analytically, for the case of white photometric 
noise, and numerically, using SOHO lightcurves, for the case of realistic 
and filtered realistic photometric noise. It is found the timing noise is nor- 
mally distributed and uncorrelated for planets likely to host large moons. 
In addition it is found that realistic stellar photometric noise results in a 
dramatic increase in the standard deviation of the timing noise, which is 



xvii 



not entirely reversed by filtering. Finally, using the method of generalised 
likelihood ratio testing, the work on the form of the timing perturbation due 
to a moon, and the behaviour of the timing noise are combined to derive 
both approximate analytic, and exact numerical thresholds. In particular, a 
Monte Carlo simulation is run which investigates thresholds for the cases of 
aligned, inclined and eccentric planet orbits for white, filtered and realistic 
photometric noise for a range of planet masses (10Mj up , lMj up , lMu ra and 
and semi-major axes (0.2AU, 0.4AU and 0.6AU). Assuming Kepler 
quality data, it is found that for the case where the photometric noise is 
white, physically realistic moons could be detected for gas giant host plan- 
ets, while for the case where the photometric noise is dominated by intrinsic 
stellar noise, filtering allows the detection of physically realistic moons for 
planets with mass 10Mj up . 
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Chapter 1 

Introduction 



1.1 Introduction 



Since the announcement of the first two extra-solar planets orbiting the pul- 



sar PSR B1257+12 flWolszczan k Frail[ |1992| ) , over 500 extra-solar planets 
have been discovered. These discoveries have been made using a variety 
of methods including the transit, radial velocity, microlensing and pulsar 
time-of-arrival techniques, and have consequently resulted in a broad vari- 
ety of detected planets. This wealth of data allows tests to be conducted on 
individual and ensemble groups of planets to investigate such things as the 



mode by which gas giant planets formed (e.g. Batygin et al. , 2009 Mardling 



2010), the conditions under which they formed and their subsequent orbital 
evolution (e.g. |Fabrycky & Winn} |2009[ |Triaud et al.[ |2010|). This is done 



through measurement and analysis of planetary properties such as mass, 
radius, orbital eccentricity and spin orbit misalignment, and by comparing 
limits on the size of extant moons with limits such as those proposed by 
Canup & Ward (2006). It is this last issue, in particular, the detection 
of extrasolar- moons, or moons of extra-solar planets to which this thesis 
addresses itself. 



1.2 Structure of this thesis 

In this thesis the detection of extra-solar moons, in particular, using the 
methods of pulse time-of-arrival perturbation and photometric transit tim- 
ing is investigated in the context of stability and formation models, as well as 

1 See e.g http://exoplanet.eu/catalogue.php 
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other moon detection methods presented in the literature. As such, this the- 
sis is divided into three main sections. The first section introduces the pre- 
liminary material required to understand this thesis, namely the notation, 
and literature results on formation, stability and detection of extra-solar 
moons. From this context, the second and third parts move on to analyse 
moon detection using the pulse time-of-arrival perturbation and photometric 
transit timing methods respectively. These parts are discussed in turn. 



1.2.1 Preliminaries and literature review 

We begin in chapter [2] with the task of discussing and defining the nota- 
tion used in this thesis. Then, the type of moons that extra-solar planets 
are expected to have is then summarised in chapter [3j In particular, work 
presented in the literature on moon formation mechanisms and subsequent 
moon orbital evolution is collated to give a set of likely constraints on the 
physical and orbital properties of moons of extra-solar planets. Then, in 
chapter [4j the set of methods proposed in the literature for detecting moons 
of extra-solar planets is summarised, along with the types of moon each 
method is optimised to detect. This is done in two main stages. First, 
the methods for detecting extra-solar planets are briefly summarised. Then, 
within this context, each of the moon detection methods presented in the 
literature is summarised with particular reference to the two methods inves- 
tigated in this thesis. 



1.2.2 Detection of moons of pulsar planets 

We begin our investigation of moon detection in Part [XT] by focussing on 
the pulse time-of-arrival perturbation technique. This investigation is con- 
ducted in two stages. First, a preliminary investigation is conducted into the 
detectability of moons of pulsar planets. This work is presented in chapter 
[5] and is published as Lewis et al. (2008). In this analysis, an expression for 



the timing perturbation due to planet-moon binarity is derived for the case 
where the orbit of the planet and the orbit of the moon are both circular, 
and in the same plane. This analysis is then used to constrain orbitally 
stable moons of the pulsar planet PSR B1620-26 b. Second, as an extension 
to this analysis, the effect of mutual inclination and mild eccentricity in the 
orbit of the planet or the moon is investigated in chapter [6j In particular, 
this investigation is conducted using a three-body formalism developed by 
my PhD supervisor, Dr. Rosemary Mardling, as it allows easy description 
of hierarchical three-body systems with arbitrary values of inclination and 
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eccentricity. It was found that mutual inclination or eccentricity introduced 
additional harmonics into the perturbation, which are summarised in fig- 
ure 16.81 



1.2.3 Detection of moons of transiting planets 



In Part III , the detectability of moons of transiting planets using the method 
of photometric transit timing is addressed. In chapter[7]the transit technique 
is reintroduced and expressions for the transit duration and shape of the 
transit light curve are derived as they are required later in the thesis. Then, 
in this context the photometric transit timing statistic r is introduced and 
defined, noting that for this thesis, a slightly more general definition of this 



statistic is used than that used by Szabo et al. (2006). Then, by algebraically 



manipulating this definition, expressions for At, the timing perturbation 
due to the moon, and €j the timing perturbation due to photometric noise, 
are derived. In particular, this formulation allows the effect of the moon 
and the noise to be investigated separately and then combined to yield 
thresholds. 

Following on from this, the form of Ar is investigated in chapter [8j In 
particular, it is investigated for the case where the moon's orbit is circular 
and in the same plane as the planet's orbit and the planet's orbit is circular 



and aligned to the line-of-sight (section 8.4), is circular and slightly inclined 



with respect to the line-of-sight (section 8.5) and eccentric and aligned to 



the line-of-sight (section 8.6). In addition, the case where the moon's orbit 
is slightly eccentric is also investigated (appendix [I]) . For the case where the 
moon's orbit is circular, the motion of the moon is roughly uniform during 
transit and the velocity of the moon on its orbit is substantially less than 
the velocity of the planet-moon pair across the face of the star, Ar is given 



by a sinusoid with coefficients given in table 8.4 



In chapter [9j the quantity €j, the timing noise on r, is examined for three 
realistic photometric noise sources, white noise, intrinsic stellar photometric 
noise and filtered intrinsic stellar photometric noise. First, the case of white 
noise is investigated analytically and compared with the qualitative results 
given in Szabo et al. (2006). Using this approach it is found that the size of 



does not necessarily decrease with decreasing exposure time as suggested 



by Szabo et al. (2006), but also depends on the origin of the photometric 



noise, for example, whether the dominant noise source is shot noise or read 
noise. Following on from this, using a method developed in this thesis for 
deriving the distribution of €j using out-of-transit data, the cases of realistic 
stellar noise and filtered stellar noise are investigated using SOHO light 
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curves. It is found that e,- is approximately normal and uncorrelated for 
all planets likely to host large moons. In addition, it is found that for 
the case where the photometric noise is dominated solar-like photometric 
variability, the amplitude of the timing noise is much larger than that for 
the equivalent amplitude white noise, and that this effect is only partially 
negated by filtering. 



Finally in chapter 10, the work on the form of At and behaviour of 
the timing noise is combined to produce preliminary detection thresholds. 
In particular, this is done using the method of generalised likelihood ratio 
testing, which involves comparing the probability that an observed sequence 
of t values was produced by a system containing a planet and a moon as 
opposed to a system containing only a planet. Using this method, analytic 
expressions for thresholds were generated for the case where the number of 
transits is large. It is found that the thresholds have a lopsided U-shape with 
minima defined by the type of photometric noise and inclination, and depth 
defined by the type of photometric noise and the eccentricity. In addition 
to this general trend, the threshold also shows a number of non-detection 
spikes corresponding to the cases where the moon orbits its host an integer 
number of times per planetary year, and consequently produces no transit 
to transit timing variations. To investigate the more realistic case where the 
number of observed transits is small, a Monte Carlo simulation is also run. 
The thresholds produced showed the same general behaviour as predicted 
by the analytic analysis. In addition, it is shown that it may be possible 
to place limits on physically realistic moons for gas giants hosts using this 
technique, for planets in the Kepler data set. 

We begin this process by discussing and selecting notation to be used 
for this thesis. 
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Chapter 2 

Notation used in this thesis 



2.1 Introduction 



Before moon detection methods can be investigated, or even literature re- 
sults summarised, a notation set must be defined and described. As notation 
used varies across the literature, the selection of notation is not necessar- 



ily straight-forward (for an extreme example of this please see section 2.4). 
Consequently, this chapter is dedicated to discussing and motivating the se- 
lection of the notation used in this thesis in three particular contexts. First, 
general notation describing the physical properties of the star, planet and 
moon will be discussed. Then, the notation required to describe the orbital 
properties of the star, planet and moon will be motivated and discussed in 
the context of two-body and three-body theory. Finally, the discussion will 
move to the notation required to describe transit light curves. For reference, 
the notation selected is presented and summarised in appendix |A} We begin 
with a discussion of general notation, in particular, describing the physical 
properties of the star, planet and moon. 



2.2 Notation used for the physical properties of 
the star, planet and moon 

For both of the two detection methods investigated in this thesis, there are 
three bodies which need to be described, namely, the star, the planet and 
the moon. These three bodies have a number of inherent physical properties 
which can effect moon detection, in particular, their mass and radius. For 
this thesis, the notation M s , M p and M m will be used for the masses where 
it is noted that the subscripts s, p and m denote properties of the star, 
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M 2 




O 

Figure 2.1: Schematic diagram of two masses (grey circles) along with the 
position vectors to these masses from the origin (O). The vector from Mi 
to M2, r, is also shown. 



planet and moon respectively. In addition, the radii of the three bodies will 
be written as R s , R p and R m (please see section 
about this choice). 



2.4 



for additional discussion 



2.3 Notation used for the orbital properties of the 
star, planet and moon 

The selection of notation for the orbital parameters of the star, planet and 
moon is a little more involved as they are part of a three-body system. Con- 
sequently, to provide a context for this discussion, the motion of two bodies 
in their mutual gravitational field will be discussed, followed by a general 
discussion of the motion of three bodies. Then, the understandings devel- 
oped from this discussion will then will be applied to the specific case of a 
star-planet-moon system, and used to define and select intuitively reasonable 
notation. We begin with a discussion of general two body motion. 



2.3.1 General two-body motion 

Consider the motion of two bodies, of mass Mi and M2, moving under 
their mutual gravitational field (see figure refTwoBodySchematic). Using 



Newton's force law, the forces acting on each of the bodies shown in figure 2.1 



8 




Figure 2.2: Schematic diagram showing the relationship between an elliptical 
orbit (dashed line), its normal (n), the reference direction and reference 
plane, and the angles /, U) and fl. 



can be expressed as 



Mi 



M 2 



d 2 Y X _ GM1M2 

d 2 r 2 
dt 2 



|r 2 - ri| 

GM 2 M 1 

I |3 



(r2 - ri), 



(ri - r 2 ), 



(2.1) 
(2.2) 



where G is the universal gravitational constant. 

Putting r = r 2 — ri, these two equations combine to give 



d 2 r G(Mi + M 2 ) . 
di 2 r 2 



0, 



(2.3) 



where r is also shown in figure 2.1, f is a unit vector in the direction of 
r, and one of the equations has dropped out as a result of conservation of 
momentum. 



Equation (2.3) can be directly solved to give the canonical conic section 



solutions for two-body motion (see e.g. Murray Sz Dermott, 1999). In par 



ticular, the orbit is described by six parameters which correspond to the six 
integration constants from the above second order vector differential equa- 
tion. Following the notation of Murray & Dermott (1999), the six orbital 



elements which uniquely define the ellipse of a two body orbit are a, the 
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o 

Figure 2.3: Schematic diagram of three masses (grey circles) along with the 
position vectors to these masses from the origin (O). In addition, the two 
Jacobian coordinates r and R are shown. 



semi-major axis, e, the eccentricity, /, the inclination, u, the argument of 
pericenter, Q, the longitude of the ascending node, and /(0), 1 the value of 
the true anomaly at t = (see figure 2.2). 

Now that two body motion has been introduced, we are in a position to 
discuss motion of three bodies in their mutual gravitational field, again from 
a general perspective. Consequently, we will repeat the above procedure for 
the case where an additional body has been added. 



2.3.2 General three-body motion 



We begin by deriving the equations of motion for three massive bodies. 
Consider three bodes of mass M\, M2 and M3 moving under the action 



of their mutual gravitational fields (see figure 2.3). Again, using Newton's 



force law, the forces acting on each of the bodies shown in figure 2.3 can be 



1 In order to determine where the body is along its orbit, an initial condition is required. 
Murray & Dermott ( 1999 1 use r, the time of pericenter passage to link t and /. However, 
as r will be used for the photometric transit timing statistic in this thesis, we use /(0) 
instead. 
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expressed as 



Mi 



Mo 



M 3 



d 2 rx 


GM 1 M 2 


dt 2 


|r 2 -ri| 3 


d 2 v 2 


GM 2 M 1 


dt 2 


1 |3 

|ri - r 2 | 


d 2 v 3 


GM 3 M 1 


dt 2 


|ri - r 3 | 3 



(r 2 - ri) + -3 (r 3 - ri) 

|r 3 - ri| 

GM 2 M 3 

(ri - r 2 J + -3 (r 3 - r 2 ) 

|r 3 - r 2 | 

GM 3 M 2 

(ri - r 3 ) + -3 (r 2 - r 3 ) 

T2 - r 3 | 



(2.4) 
(2.5) 
(2.6) 



Equations (2.4) to (2.6) can be rewritten in terms of Jacobian coordi- 
nates, that is, r, the vector from Mi to M 2 , and R, the vector from the 
center of mass of Mi and M 2 to M 3 , defined as 



r 2 



R 



ri, 
Mi 



Mo 



Mi+M 2 



ri 



Mi +Mi 



r 2 + r 3 , 



(2.7) 
(2.8) 



where r and R are shown in figure 2.3 



Rewriting equations (2.4) to (2.6) in terms of r and R gives 
d 2 r 



dt 2 



, GM 12 
H — r 



GM 3 



+ 



GM 3 



R 



Mi 
M, 2 



R 



Mi 
Ma"' 



(2.9) 



d 2 R GM 123 „ GM 123o 

H ^5 1 1 = ^ XV 



GMi 23 



(it 2 



i? 3 



fl 3 



M 2 
M12 J 



M 2 
Mi 2 



GMi 23 



R 



Mi 
Mi 2 



R 



Mi 
Ma" 1 



(2.10) 



where M 12 is defined as Mi + M 2 , M i23 is defined as Mi + M 2 + M 3 and 
where, again, one of the equations has dropped out as a consequence of 
conservation of momentum. 



Equations (2.9) and (2.10) can be simplified further by writing the right 



hand sides in terms of a function known as the disturbing function. Following 



Mardling (2008b) we define the disturbing function, TZ, to be 
GM U M 3 GM 2 M 3 GMiM 3 



TZ 



R 



+ 



IR 



Mi 
Mj 2 



+ 



|R + M^ r \ 
l**-^ Mi 2 r l 



(2.11) 
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where we note that this definition of the disturbing function has units of 
energy. Simplifying equations (2.9) and (2.10) using equation (2.11) gives 



M x M 2 Sy GM\Mi „ 
M12 dfi + V 2 F 

M 12 M 3 d 2 R G (Mi + M 2 ) M 3 ~ 
H ^ K, 



dr ' 



M 123 eft 2 



i? 2 



dU 



(2.12) 
(2.13) 



where r = |r| and -R = |R|, where f and R are unit vectors in the directions 
of r and R respectively. 

In addition, r = r(x, y, z) where the xyz coordinate system has its origin 
at Mi and R = R(-X", Y, Z) where the XYZ coordinate system has its origin 
at the center of mass of Mi and M 2 , and thus 

d . d . d d 
or ox oy oz 
d . d . d d 
&R =1 dX +i dY + 8Z' 

For the case where the three-body system is hierarchical, that is, the or- 
bit of Mi and M 2 about their common barycenter and the orbit of M 3 and 
the Mi-M 2 barycenter about the system barycenter are described by per- 
turbed two-body motion, these equations have physical meaning. This can 



be seen in the structure of the equations. Conceptually, equations (2.12) and 



(2.13) have two components. The first component consists of the left hand 



sides of equations (2.12) and (2.13). If the right hand side of the equations 



were neglected, then both equations would be mathematically equivalent to 



equation (2.3), the equation for two-body motion. Consequently, for this 
case, the motion of Mi and M 2 around the Mi-M 2 barycenter, the "inner" 
orbit, would be described by conic sections. In addition, the motion of M 3 
and the Mi-M 2 barycenter around the system barycenter, the "outer" or- 
bit, would be described by different conic sections. The second component 



consists of the terms on the right hand sides of equations (2.12) and (2.13). 
These terms allow the "inner" and "outer" orbits to interact. 

Now that general-three body theory has been introduced, we are finally 
in a position to apply it to the case of a star-planet-moon system. 



2.3.3 Three-body motion for the case of a star, planet and 
moon 

For the case of a planet-moon pair in orbit around a star, we can associate 
Mi with the planet, M 2 with the moon and M 3 with the star. For this 
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Figure 2.4: Schematic diagram of a star-planet-moon three-body system. As 



in figures 2.1 and 2.3, the masses are shown as grey circles and labeled by 
subscripts. In addition, the two Jacobian coordinates are shown and written 
as r m and r p (see text). 



case, the "inner orbit" would describe the elliptical orbit of the planet and 
moon about their common barycenter while the "outer orbit" would describe 
the elliptical orbit of the planet-moon pair, and the star, about the system 
barycenter. While the terms "inner" and "outer" are general, they are not 
very intuitive, and it is not immediately obvious how the orbital elements of 
these orbits should be labeled. Consequently, for this thesis we will call the 
"inner" orbit, the "moon's" orbit and label the vector and orbital elements 
associated with it with a subscripted m, and call the "outer" orbit, the 
"planet's" orbit and label the vector and the orbital elements associated 
with it with a subscripted p. Consequently, using this notation, the two 
governing equations are given by 

M m M p d 2 r m GM m M p A _ dU 
M p + M m dt 2 + r 2 m Vm dvj [Z - LV 
(M p + M m )M s d 2 v p G{M p + M m )M s „ _ &R 



M s + M p + M m dt 2 r 2 p dr 



where r m and r p are shown in figure 2.4 and where 



G{M p + M m )M s GM m M s GM p M s 

n = v \ m r + i h r- ( 2 - 16 ) 

r„ r- p t r -I r 

While this notation is being used, it should be pointed out that the re- 
sults of this work are still entirely general, in that the ratio of the moon's 
mass to the planet's mass can freely range from zero to one. For exam- 
ple, if the semi-major axis of the orbit of the moon about the planet-moon 



13 




Time 



Figure 2.5: Diagram showing the different portions of the transit light curve. 
The four shaded circles show the planet's position across the face of the star 
at the beginning and end of ingress, and the beginning and end of egress. 
As the position of the planet along the chord of the star is a linear function 
of time, it can be used as a proxy for time. Consequently the position of 
the planet and the value of the light curve resulting from that position are 
linked by dashed lines. 



barycenter is required, it will be written as (M p /(M p + M m ))a m . Simi- 
larly, the semi-major axis of the orbit of the planet about the planet-moon 
barycenter is given by (M m /(M p + M m ))a m . Recalling that the six orbital 
elements which uniquely define the ellipse of a two body orbit are a, e, I, 
u, f2, and /(0), we have that, the orbital elements of the planet's orbit are 
given by a p , e p , I p , uj p , fl p and f p (0), while the orbital elements of the moon's 
orbit are given by 

Oimi &m, Imi U) m , &>m and / m (0). 

In addition to general notation defining the physical and orbital proper- 
ties of the star, planet and moon, there is one addition situation where ad- 
ditional notation is required, namely, the description of transit light curves. 
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2.4 Notation specific to transit light curves 



A planetary transit occurs when a planet passes between the observer and 
the face of its host star, and consequently blocks some of that star's light, 
and a transit light curve is the measured luminosity of a star undergoing a 
planetary transit. A schematic of a sample transit light curve is shown in 
figure |2.5| As can be seen in figure |2.5[ the transit itself consists of three 
main stages, first the ingress, where the disk of the planet is passing onto 
the face of the star, second, the main part of the transit, where disk of the 
planet fully overlaps with the disk of the star, and third, the egress, where 
the disk of the planet is passing off the face of the star. 

In particular, the shape of the light curve contains information about the 
planet and its orbit, for example, the duration of ingress and egress contains 
information about the size of the planet and the inclination of the orbit, and 
the dip depth tells us about the relative size of the planet compared to the 
star. In addition to information about the planet, the shape of this light 
curve also determines the effectiveness of detection of moons using both the 
method analysed in this thesis, as well as other methods in the literature. 
So, in order to investigate these methods, we need to be able to describe the 
shape of the transit light curve. 

Consequently, the question arises of what notation to use to describe 
this transit light curve. In particular, we would like a notation set which is 
in general use, self-consistent and optimised for describing moon detection. 
These issues will be addressed in turn, and then used to decide on a notation 
system. 

From a practical perspective we would like a notation set which is easy 
to understand. This can be partially ensured if it is already currently in 
general use in the transiting planet literature. When considering this goal, 
two questions naturally arise, "What notation is currently in general use in 
the transiting planet community?" and "What do we mean by transiting 
planet community?" . To address the first question, a literature review was 



conducted, focussing on notation styles. For reference, table 2.1 shows a 
representative, but by no means a complete list of notation styles in use. As 
can be seen from table [2~T| while there are some trends, for example, transit 
duration is generally represented by a t or a T while radii are represented by 
a R or r with a subscript, 2 there is no universally accepted notation style. 
The origin of this broad range of notation styles, can partially be understood 



2 Recall that for this thesis it was decided to use R s , R P and R m for the radius of the 
star, planet and moon respectively. 
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in terms of the answer to the second question. Within the transiting planet 
community (and the list of works in table 2.1 ), there are authors who are in- 
terested in the shape of transit light curves, to help streamline the detection 
process (e.g. Tingley & Sackett, 2005), to determine properties of the planet 



and star (e.g. Seager & Mallen-Ornelas 2003) or because they are related 



to eclipsing binary light curves (e.g. Gimenez 



2006). In addition, a number 



of authors are not interested in the shape of the light curve per se, but more 
in perturbations in the mid-time or duration of transits as they can be used 
to detect additional bodies such as planets (e.g. Holman & Murray, 2005) 
or even moons (e.g. Kipping et al. 2009). From within these groups, nota- 



tion clusters start to emerge, for example, Kipping et al. (2009) and Carter 



et al. (2008) use the similar notation because Kipping et al. (2009) adopted 



some of their notation from Carter et al. (2008). In addition, work from 
researchers from other fields carries with it the notation used in those fields, 
for example, the notation of Gimenez (2006) is inherited from the work of 



Kopal (1979) on binary star eclipse light curves. 



In addition to selecting notation which is generally understood, it must 
also be self-consistent and also consistent with the notation already in use 
for this thesis. We will consider the issues of self-consistency and being 
consistent with the notation already in use, in turn. As none of the notation 
sets contain all the variables required for this work, a hybrid, self-consistent 
notation, combining two or more of these sets must be constructed. As can 
be seen from table 2.1 if some sets of notation were combined, it would 
lead to confusion. For example, consider a combination of the light curve 



notation of Carter et al.| ( 2008) with the geometric notation of 
As Carter et al. (2008) use S for the depth of the dip and 



2006 



2006 



Gimenez 



Gimenez 



use 5(t) for the distance between the planet and the star on the plane 
of the sky, the variable 5 is not uniquely defined. In addition to selecting 
a self-consistent set of notation, we would also like a notation set which 
is compatible with the notation discussed in the previous section and with 
that used in the literature with respect to photometric transit timing. For 
example, the notation used by Carter et al. ( |2008[ ) cannot be used in this 
work as the symbol they selected for the duration of transit ingress, r, is the 
same symbol used by Szabo et al. (2006) to describe their timing statistic. 

In addition to selecting comprehensible self-consistent notation, the nota- 
tion selected should be able to easily describe the transiting moon detection 
method we are focussing on in this thesis, photometric transit timing, but 
also be able to describe moon detection methods in general for comparison 
purposes. The first issue to address is the different ways that different works 
define light curve parameters, for example Seager Sz Mallen-Ornelas (2003) 
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define the transit depth to be constant, while Mandel & Agol (2002) allow it 
to be a function of time, and Carter et al. ( 2008 ) define the transit duration 



as lasting from middle of ingress to the middle of egress, while Tingley & 



Sackett (2005) define the transit duration as lasting from the beginning of 



ingress to the end of egress. Consequently, we need to select the set of light 
curve parameter definitions which most easily describe the quantities that 
we want. 

For the case of photometric transit timing we will need to sum over the 
transit depth for each exposure in the transit, as well as other exposures 
outside the planetary transit (see equation (7.46)). As we will see in sec- 
tion 4.3.2, moons lead to additional dips in the transit light curve, with the 
dips caused by the moons stretched or compressed and translated from that 
of their host planet. To describe these lumpy light curves we need a notation 
which allows the transit depth to be a function of time. In particular it was 
decided to describe the geometry of the transit and the transit depth using 
the notation of Gimenez (2006), partially as it is one of the notation styles 
to describe the dip depth as a function of time, and partially as it allows 
easy access to a range of mathematical methods, for example, describing dip 
depth as a surface integral. Consequently for this thesis, the transit depth 
is given by a(t), with a p (t) and a m (t) representing the portion of the dip 
resulting from the transit of the planet and moon respectively. In addition 
5 p (t) and 5 m (t) represent the projected distance between the center of the 
star and the planet and moon respectively, while 5 m i n represents the impact 
parameter for the transit of both the planet and the moon, as both values 
are approximately the same for all cases considered in this thesis (see sec- 
tion 8.2.1). In addition to describing the transit depth, the duration of the 



transit of the planet and moon must also be described. 

Recalling the range of definitions of transit duration given in table 2.1 it 
can be seen that in order to describe this quantity, both a definition and ap- 
propriate notation must be selected. For this thesis it was decided to define 
the transit as beginning and ending when the center of the body passes onto 
and off the stellar limb, and the transit duration as the difference between 
these two times, that is, the definition shown in column 2 of table [2~T| This 
particular definition was selected to reduce the algebraic complexity of the 
expressions derived in chapter [8| by ensuring that transit duration did not 
depend on the radius of the transiting body. Informed by the notation styles 
presented in table 2A_ it was decided to represent the transit duration by 
Tt r a, with Tt ra describing the transit duration of a planet with no moon, 
and with Tt ra ,p and Tt ra ^ m denoting the duration of the planet's and moon's 
transit for the case of a planet with a moon. 
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In addition, for the case of moon detection using other methods, other 
light curve parameters also need to be described. In addition to being able 
to describe the dip due to the moon and the transit duration, quantities for 
which notation has already been selected, it would also be useful to be able to 
describe the duration of ingress and the transit mid-time. Unfortunately, the 
only work which explicitly describes the duration of ingress and egress, uses 
the notation r, which we cannot use as it is the test statistic for photometric 
transit timing. Consequently, the notation selected above was extended 
such that Ti n describes the duration of ingress. Again T« n is understood 
to describe the duration of ingress for the case of a planet with no moon, 
and with Tj njP and Tj ni?n denote the duration of ingress for the case of the 
transit of the planet and moon the case of a planet with a moon. Finally, 
the mid-time of the transit is given by t m id, where again t m id represents 
the transit mid-time for the case of a planet with no moon, while t m id,p and 
tmid,m represent the transit mid-times for the transit of the planet and moon 
respectively for the case of a planet and a moon. In addition, for the th 
transit, t m id may also be written as to- F° r reference each of these decisions 
is summarised in table 2.1 and also given in appendix |A} 



2.5 Conclusion 

The notation that will be used for this thesis has been discussed in the 
context of three broad areas. First notation for the physical properties of 
the star, planet and moon was discussed. Second, the notation required 
to describe the orbital motion of the star, planet and moon was discussed 
in the context of three-body theory. Finally, the notation required for the 
description of the transit light curve was selected. In addition, for reference, 
the notation selected is summarised in appendix [A] Now that the issue of 
notation has been discussed and a framework decided, we can move on to 
start to discuss the literature, in particular, the types of moons that are 
likely to exist. 
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Chapter 3 

Constraints on extra-solar 
moons 

3.1 Introduction 

Before considering the types of moons able to be detected, and the mechanics 
of moon detection, it would be useful to have an understanding of the types 
of moons extra-solar planets are likely to possess, and in particular, the 
properties of any large (and consequently detectable moons). First, we will 
begin this investigation by looking at the census of moons present in the 
Solar System. Second, guided by these results, the characteristics of moons 
predicted to form will be summarised. Third, the ways in which tidal and 
three body effects can modify moon orbits will be discussed. Finally, these 
sources of information will be combined to provide an indication of the types 
of moons that extra-solar planets are likely to possess. 

3.2 Characteristics of moons in the Solar System 

We begin our discussion of likely moon properties by summarising the prop- 
erties of the moons that we know about, the moons in the Solar System. As 
planets in the Solar System are divided into two distinct types, terrestrial 
and gas giant, according to their planet-Sun distance, composition and for- 
mation history, the properties of moons of terrestrial planets and gas giants 
will be discussed separately. We begin with a discussion of the moons of 
terrestrial planets. 
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"Planet" 


"Moon" 






M m 






Cm 








(km) 


(10 20 kg) 




(M p ) 




(10 3 km) 


n 




Earth 


Moon 


1738 


734.9 




0.0123 




384.4 


0.0554 


5.16 


Mars 


Phobos 


11* 


1.1 x 10" 


4 


1.7 x 10- 


-8 


9.376 


0.0151 


1.08 




Deimos 


6* 


1.8 x 10" 


5 


2.8 x 10" 


-9 


23.458 


0.0002 


1.79 


Pluto 


Charon 


593 


15 




0.118 




17.536 


0.0022 


0.001 




Nix 




5.8 x 10- 


3 


4.6 x 10" 


-5 


48.708 


0.0030 


0.20 




Hydra 




3.2 x 10- 


3 


2.5 x 10- 


-5 


64.749 


0.0051 


0.21 



Table 3.1: Physical and orbital properties of the satellites of the Earth, 
Mars and Pluto. "Large" satellites are shaded grey. Radii and masses for 



the Moon, Phobos, Deimos and Charon are taken from (Murray <__ Der 



mott 


1999) 


et al. 


2008 



Orbital parameters were taken from the JPL website. The 



inclinations are measured relative to the local Laplace plane. 



3.2.1 Moons of terrestrial planets 

Of the four terrestrial planets in our Solar System, only the two most distant 
planets, Earth and Mars, host moons, and out of these two planets only the 
Earth hosts a relatively large moon. In addition, a number of planet-like 
objects, in particular Pluto, also host "moons". For the case of Pluto, this 
includes its large moon Charon, and its two much smaller moons, Nix and 
Hydra. For reference and comparison, the satellite systems of Earth, Mars 



and Pluto are shown in figure 3.1 and their properties are summarised in 
table 3.1 As we are discussing moon properties within the context of moon 
detection, we will focus our attention on large (and consequently detectable) 
satellites such as the Moon and Charon, and neglect the irregular moons of 
Mars and Pluto's small moons Nix and Hydra. 

Focussing our attention on the Earth-Moon system and the Pluto-Charon 
system, it can be seen that they share a number of properties. First, in both 
of these cases there is only one large moon per satellite system. Second, this 
single large moon contains a fair percentage of the mass in the planet-moon 
system, ~1% for the case of the Moon and ~10% for the case of Charon. 



1 http://ssd.jpl.nasa.gov/7sat_elem. 
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I u 

Semi-major axis (AU) 
(a) Earth's satellite system. 




Semi-major axis (AU) 
(b) Mars' satellite system. 




Semi-major axis (AU) 
(c) Pluto's satellite system. 



Figure 3.1: Schematics of the satellite systems of the Earth, Mars and Pluto. 
Large satellites, in particular, the Moon and Charon, are shown as large filled 
dark grey circles with radii proportional to the radius of the corresponding 
satellite. For reference, the Moon, has a radius of 1737km. Satellites too 
small to have a spherical shape, are shown as small dots. The dashed line 
on the right denotes half a planetary Hill radius. 
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Third, the orbit of this moon about their host planet is relatively close and 
relatively circular. Finally, in both cases, the moon's orbit is approximately 
aligned with the planet's equator e.g. the Moon's orbit is tilted by only 5 
degrees. These shared properties suggest a common formation mechanism 



for these large moons, an issue that will be discussed in section 3.3.1 



3.2.2 Moons of gas giant planets 

Unlike the terrestrial planets, each of the four gas giant planets in our Solar 
System has a plethora of attendant moons with a grand total of 62, 59, 27 
and 13 moons for Jupiter, Saturn, Uranus and Neptune respectively, as of 
the 12 th of February 2010. For reference, the satellite systems of Jupiter, 



Saturn, Uranus and Neptune are shown in figure 3.2, while their properties 



are summarised in tables 3.2, 3.3 3.4 and 3.5, What is immediately obvious 



from figure 3.2 is that there are three main classes of moon. First, there 
are small inner moons with orbits very closely aligned to their host planet's 
equator. These moons seem to be generally associated with, or help shape 
the ring systems of their host planet. Second, outside this set of inner moons 
each of the gas giant planets has a number of larger, regular moons. Finally, 
further away still, are irregular moons with orbits which become increasingly 
retrograde the further they are from their host planet. Again noting that 
the aim of this investigation is gain an intuitive understanding of the the 
types of satellites that are likely to exist (and consequently be detected) , we 
will focus this discussion on the regular satellites, as they are the largest, 
and most detectable. 



From figure 3.2 it can be seen that the regular satellites of the gas giant 
planets all share a number of features, with one main exception, Triton, 
Neptune's only large moon. Consequently, the satellites of Jupiter, Saturn 
and Uranus will first be discussed, followed by a separate discussion of the 
properties and peculiarities of Triton. 

The regular satellites of Jupiter, Saturn and Uranus share a number of 
features. First, these large moons only occur within a minimum distance 
from their planet, compared to the irregular moons for example, which oc- 
cupy orbits reaching to the edge of the stability region, estimated by half a 
Hill radius. Second, a planet can have multiple large moons e.g. the four 
Galilean satellites of Jupiter. This is in stark comparison with case for ter- 
restrial planets where one large moon (if any) per planet seems to be the 
norm. Third, while the mass of these satellites is comparable to that of the 
Moon, it is small in comparison to the mass of the host planet, in particular, 
the proportion of the planet-moon system's mass that is in regular satellites 
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Semi-major axis (AU) 
(a) Jupiter's satellite system. 




Semi-major axis (AU) 
(b) Saturn's satellite system. 




10 10 

Semi-major axis (AU) 

(c) Uranus' satellite system. 




Semi-major axis (AU) 

(d) Neptune's satellite system. 

Figure 3.2: Schematics of the satellite systems of the four gas giants in 
the Solar System. The regular satellites are shown as large filled dark grey 
circles with radii proportional to the radius of the corresponding satellite. 
For reference, Titan, the largest satellite of Saturn, has a radius of 2575km. 
Satellites too small to have a spherical shape, that is, inner satellites and 
irregular satellites, are shown as small dots while planetary rings are denoted 
by a thick light grey line. The surface of the planet is represented by a thick 
vertical line on the left, while on the right a dashed line denotes half a 
planetary Hill radius. 
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is 2.1 x 10 -4 , 2.5 x 10 -4 and 1.1 x 10 for Jupiter, Saturn and Uranus 
respectively. Finally, the orbits of these moons are circular and aligned with 
the equator of their host planet. 2 A number of processes proposed which 



will produce such systems will be explained in section 3.3.2 



Compared to the satellite systems of Jupiter, Saturn and Uranus, Nep- 
tune's satellite system is distinctly odd. First, it only has one large moon, 
Triton, and second, this moon's orbit is inclined and retrograde. However, 
similar to the moons of Jupiter, Saturn and Uranus, Triton is also relatively 
close to its host planet and not very massive compared to it. As a result of 
these properties it has been proposed that it probably did not form in situ, 



but was captured. This process will be discussed further in section 3.3.3 



Moon 






M m 






Cm 






(km) 


(10 20 kg 


) (10" 4 M p ) 


(10 3 km) 






n 


Metis 


20 






128 


0.116 


0.001 


0.019 


Adrastea 


10 






129 


0.116 


0.002 


0.054 


Amalthea 


86* 






181.4 


0.164 


0.003 


0.380 


Thebe 


50 






221.9 


0.200 


0.018 


1.080 


Io 


1821 


893.3 


0.4705 


421.8 


0.381 


0.004 


0.036 


Europa 


1565 


479.7 


0.2527 


671.1 


0.606 


0.009 


0.466 


Ganymede 


2634 


1482 


0.7806 


1070.4 


0.966 


0.001 


0.177 


Callisto 


2403 


1076 


0.5667 


1882.7 


1.699 


0.007 


0.192 


Themisto 








7507 


6.775 


0.242 


43.07 


Leda 


5 






11165 


10.08 


0.164 


27.46 


Himalia 


85 






11461 


10.34 


0.162 


1.438 


Lysithea 


12 






11717 


10.57 


0.112 


28.30 


Elara 


40 






11741 


10.60 


0.217 


143.6 


Carpo 








17078 


15.41 


0.444 


51.16 


S/2003J12 








17835 


16.10 


0.488 


150.8 


Euporie 








19339 


17.45 


0.144 


145.5 


S/2003J3 








20230 


18.26 


0.203 


147.8 


Continued 


on next 


page . . . 













2 Uranus has an orbital obliquity of approximately 98 °, and consequently its equatorial 
and orbital planes are very different. The inclinations shown in figure [372] and presented in 
table [3T4] are measured relative to the local Laplace plane, the plane which defines the axis 
about which the moon's orbit normal precesses. This plane is approximately parallel with 
the equatorial plane of the planet for close in satellites, e.g. the regular satellites, and is 
approximately aligned with the orbital plane of the planet for more distant satellites, e.g. 
the irregular satellites. 
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Table 3.2 - Continued 
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0.102 


146.0 


O / anno t -i r* 

D/2003J16 








20948 


18.91 


n ■ v > i 

0.231 


1 A O d 

148. b 
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OI 1 JO 

21148 


19.09 


n ion 

0.230 


149.0 
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1 nelxmoe 
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0.219 


i r 1 o 

151.3 


ml 

1 nyone 








21192 


19.13 


0.238 


148.8 


Iocaste 








21272 


1 n on 

19.20 


n oi r 

0.215 


i A r\ a 

149.4 


A 1 

Ananke 


1 n 

10 






21276 


19.20 


0.244 


148.9 


TT 

Hermippe 








21300 


19.22 


0.212 


150.9 


S/2003J15 








O O / * O ■ "i 

22622 


20.42 


0.187 


146.4 


o /onno Tin 
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23042 


on on 

20.80 


n /i oo 

0.428 


i r* r o 

165.2 


Pasithee 








23090 


on o a 

20.84 


0.267 


1 r* r* n 

165.0 


Eurydome 








23148 


in en 
20.89 


n 17£; 
0.2/6 


i £n o 
150.2 


Lnaldene 








001 ^n 

23179 


on no 
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n on 

0.251 


165.2 


Isonoe 








23231 


on nT 

20.97 


O O A i"7 

0.247 


165.3 


Kallichore 








0Q07Q 

ZoZ f o 


o 1 nn 
21. UU 


n O A O 


1ftC 1 


Erinome 








23283 


21.01 


n o/?/? 
U.2DD 


io4.y 


Kale 








OQQAO 


OI HQ 

zl.Uo 


n or;o 
U.zoz 


lDO.1 


Aitne 








23315 


o i r\ a 

21.04 


n o/T 1 

0.2bb 


165.1 


Eukelade 








23322 


21. U5 


n o/?'7 


165.2 


Arche 








23355 


o i no 

21.08 


n oc;r 

0.256 


164.9 


Taygete 








23363 


21.09 


0.252 


165.2 


S/2003J9 








23385 


21.11 


0.264 


165.2 


Carme 


15 






23404 


21.12 


0.253 


164.9 


Herse 








23405 


21.12 


0.249 


164.8 


S/2003J5 








23493 


21.20 


0.246 


165.3 


S/2003J19 








23532 


21.24 


0.262 


165.2 


S/2003J23 








23549 


21.25 


0.270 


146.3 


Kalyke 








23564 


21.27 


0.246 


165.2 


Hegemone 








23566 


21.27 


0.344 


154.0 


Pasiphae 


18 






23624 


21.32 


0.409 


170.5 



Continued on next page . . . 
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Table 3.2 - Continued 



Moon 


Rm 


M m 


CI )ji 






Ijrj i 




(km) 


(10 20 kg) (l(T 4 M p ) 


(10 3 km) 


(R c ) 




(°) 


Cyllene 






23787 


21.47 


0.418 


150.2 


Sponde 






23790 


21.47 


0.313 


151.2 


Magaclite 






23808 


21.49 


0.421 


152.8 


S/2003J4 






23928 


21.60 


0.356 


149.3 


Sinope 


14 




23939 


21.61 


0.250 


158.1 


Aoede 






23969 


21.63 


0.432 


158.3 


Autonoe 






24033 


21.69 


0.317 


152.3 


Callirrhoe 






24102 


21.75 


0.283 


147.2 


Kore 






24486 


22.10 


0.332 


145.0 


S/2003J2 






28332 


25.57 


0.411 


157.1 



Table 3.2: Physical and orbital properties of the satellites of Jupiter. The 
regular satellites are highlighted in grey. Radii and mass measurements are 
taken from (Murray & Dermott, 1999), while orbital parameters are taken 
from the JPL website. The inclinations are measured relative to the local 
Laplace plane. 



Moon 


Rm 




M m 










e, 


71 


Im 




(km) 


(10 20 kg) 


(10- 4 M P ) 




km) 


(Rc) 






( C 


) 


Pan 


10 






133 


58 





098 





000 





001 


Daphnis 








136 


5 





100 





000 





000 


Atlas 


16* 






137 


67 





101 





001 





003 


Prometheus 


50* 


0.0014 


2.5 x 10~ 6 


139 


38 





103 





002 





008 


Pandora 


42* 


0.0013 


2.3 x 10~ 6 


141 


72 





104 





004 





050 


Epimetheus 


59* 


0.0055 


9.7 x 10~ 5 


151 


41 





111 





010 





351 


Janus 


89* 


0.0198 


3.5 x 10~ 5 


151 


46 





111 





007 





163 


Mimas 


199 


0.385 


6.8 x 10" 5 


185 


54 





137 





020 


1 


574 


Methone 








194 


44 





143 





000 





007 


Pallene 








212 


28 





156 





004 





181 


Enceladus 


249 


0.73 


0.0013 


238 


04 





175 





005 





009 


Tethys 


530 


6.22 


0.0109 


294 


67 





217 





000 


1 


091 


Telesto 


11* 






294 


71 





217 





000 


1 


180 


Calypso 


10* 






294 


71 





217 





001 


1 


499 


Polydeuces 








377 


2 





278 





019 





177 
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Table 3.3 - Continued 



Moon 


R 




M m 












(km) 


(10 20 


kg) (10" 4 M„) 


(10 3 km) 


{R c ) 




(°) 


Helene 


16* 


" 




377.42 


0.278 


0.007 


0.213 


Dione 


560 


10.52 


0.0185 


377.42 


0.278 


0.002 


0.028 


Rhea 


764 


23.1 


0.0406 


527.07 


0.388 


0.001 


0.333 


Titan 


2575 


1345.. 




1221.9 


0.899 


0.029 


0.312 


Hyperion 


143* 






1500.9 


1.104 


0.023 


0.615 


Iapetus 


718 


15.9 


0.0280 


3560.9 


2.620 


0.029 


8.313 


Kiviuq 








11311 


8.323 


0.164 


48.53 


Ijiraq 








11367 


8.365 


0.458 


47.12 


Phoebe 


110 






12947 


9.528 


0.163 


175.2 


Paaliaq 








15024 


11.06 


0.540 


41.77 


Skathi 








15614 


11.49 


0.294 


150.8 


Albiorix 








16401 


12.07 


0.484 


35.51 


S/2007_S2 








16723 


12.31 


0.178 


175.6 


Bebhionn 








17117 


12.60 


0.484 


34.56 


Erriapus 








17611 


12.96 


0.468 


38.65 


Skoll 








17663 


13.00 


0.470 


160.2 


Tarqeq 








17909 


13.18 


0.119 


49.57 


Siarnaq 








18015 


13.26 


0.405 


44.51 


Tarvos 








18263 


13.44 


0.531 


35.95 


S/2004_S13 








18408 


13.55 


0.260 


169.1 


Hyrokkin 








18437 


13.57 


0.329 


151.2 


Greip 








18442 


13.57 


0.316 


173.3 


Mundilfari 








18667 


13.74 


0.205 


169.2 


S/2006_S1 








18797 


13.83 


0.118 


155.0 


S/2007J33 








18981 


13.97 


0.185 


175.7 


Bergelmir 








19338 


14.23 


0.142 


158.9 


Jarnsaxa 








19356 


14.24 


0.217 


163.3 


Narvi 








19417 


14.29 


0.426 


143.2 


S/2004.S17 








19449 


14.31 


0.181 


168.0 


Suttungr 








19476 


14.33 


0.114 


173.9 


Hati 








19775 


14.55 


0.373 


165.0 


S/2004_S12 








19867 


14.62 


0.323 


163.3 


Bestla 








20278 


14.92 


0.474 


141.7 


Farbaut 








20387 


15.00 


0.245 


158.0 


Thrymr 








20439 


15.04 


0.466 


173.7 
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Table 3.3 - Continued 



Moon 








Cm 


Im 




(km) 


(10 20 kg) (10- 4 M P ) (10 3 km) 


(Rc) 




(°) 


Aegir 




20749 


15.27 


0.252 


167.1 


S/2004_S7 




21005 


15.46 


0.530 


164.9 


Kari 




22077 


16.25 


0.484 


155.9 


S/2006_S3 




22100 


16.26 


0.404 


158.9 


Fenrir 




22454 


16.52 


0.133 


164.4 


Surtur 




22920 


16.87 


0.447 


169.1 


Loge 




23065 


16.97 


0.188 


167.2 


Ymir 




23140 


17.03 


0.334 


171.7 


Fornjot 




25151 


18.51 


0.210 


169.7 



Table 3.3: Physical and orbital properties of the satellites of Saturn. The 
regular satellites are highlighted in grey. Radii and mass measurements are 
taken from (Murray & Dermott, 1999), while orbital parameters are taken 
from the JPL website. The inclinations are measured relative to the local 
Laplace plane. 



Moon R m M m a m e m I m 



(km) (10 20 kg) (1Q- 4 M P ) (10 3 km) (R c ) Q 



Cordelia 


13 






49 


8 





034 





000 





085 


Ophelia 


16 






53 


8 





037 





010 





104 


Bianca 


22 






59 


2 





041 





001 





193 


Cressida 


33 






61 


8 





042 





000 





006 


Desdemona 


29 






62 


7 





043 





000 





113 


Juliet 


42 






64 


4 





044 





001 





065 


Portia 


55 






66 


1 





045 





000 





059 


Rosalind 


29 






69 


9 





048 





000 





279 


Cupid 








74 


392 





051 





001 





099 


Belinda 


34 






75 


3 





052 





000 





031 


Perdita 








76 


417 





052 





012 





470 


Puck 


77 






86 







059 





000 





319 


Mab 








97 


736 





067 





003 





134 


Miranda 


761 


30.14 


0.3471 


129.9 





089 





001 


4 


338 
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Table 3.4 - Continued 



Moon 


Rm 




M m 






Cm 


Im 




(km) 


(10 20 kg 


) (i(r 4 M p ) 


(10 3 km) 


W 




(°) 




Ariel 


235* 


0.659 


0.00759 


190.9 


0.131 


0.001 


0.041 


Umbriel 


579* 


13.53 


0.1558 


266 


0.182 


0.004 


0.128 


Titania 


585 


11.72 


0.1350 


436.3 


0.299 


0.001 


0.079 


Oberon 


789 


35.27 


0.4062 


583.5 


0.399 


0.001 


0.068 


Francisco 








4282.9 


2.932 


0.132 


147.3 


Caliban 








7231.1 


4.949 


0.181 


141.5 


Stephano 








8007.4 


5.481 


0.225 


143.8 


Trinculo 








8505.2 


5.822 


0.219 


167.0 


Sycorax 








12179 


8.336 


0.522 


159.4 


Margaret 








14146 


9.683 


0.677 


57.37 


Prospero 








16276 


11.14 


0.445 


151.8 


Setebos 








17420 


11.92 


0.591 


158.2 


Ferdinand 








20430 


13.98 


0.399 


169.8 



Table 3.4: Physical and orbital properties of the satellites of Uranus. The 
regular satellites are highlighted in grey. Radii and mass measurements are 
taken from (Murray & Dermott, 1999), while orbital parameters are taken 



from the JPL website. 
Laplace plane. 



The inclinations are measured relative to the local 



Moon 


Rm 


M m 






Cm 


Im 




(km) 


(10 20 kg) (W~ 4 M p ) 


(10 3 km) 


(Rc) 




(°) 


Naiad 


29 




48.227 


0.004 


0.000 


4.691 


Thalassa 


40 




50.074 


0.004 


0.000 


0.135 


Despina 


74 




52.526 


0.005 


0.000 


0.068 


Galatea 


79 




61.953 


0.005 


0.000 


0.034 


Larissa 


94* 




73.548 


0.006 


0.001 


0.205 


Proteus 


209* 




117.65 


0.010 


0.000 


0.075 


Triton 


1353 


215 2.10 


354.76 


0.031 


0.000 


156.9 


Nereid 


170 




5513.8 


0.475 


0.751 


7.090 


Halimede 






16611 


1.430 


0.265 


112.7 


Sao 






22228 


1.914 


0.137 


53.48 
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Table 3.5 - Continued 



Moon 




M m dm 




C-rn 


Im 




(km) 


(10 20 kg) (10- 4 M P ) (10 3 km) 


w 




(°) 


Laomedeia 




23567 


2.029 


0.397 


37.87 


Psamathe 




48096 


4.141 


0.381 


126.3 


Neso 




49285 


4.244 


0.571 


136.4 



Table 3.5: Physical and orbital properties of the satellites of Neptune. The 
regular satellites are highlighted in grey. Radii and mass measurements are 
taken from (Murray & Dermott, 1999), while orbital parameters are taken 



from the JPL website. 
Laplace plane. 



The inclinations are measured relative to the local 



3.3 Formation models 

Planetary formation is currently a vigorous area of research due partially 
to the exponential increase in computing power with time and partially to 
the high rate of discovery of extra-solar planets. Recent simulations suggest 
that formation mechanisms can place physical limits on the mass, number 
and orbital parameters of moons. As theories of moon formation are built 
on underlying theories of planet formation the limits on moon mass and 
orbital parameters will be discussed within the context of the method by 
which the host planet formed. 

According to current theories, planets generally produce/acquire large 
moons in the final stages of planet building. For the case of terrestrial 
planets, it is proposed that moon formation occurs during the chaotic growth 
phase of planetary formation. During this phase, it is believed that moon- 
sized embryos on eccentric orbits perturb and impact with each other. For 
the case of gas giants, it is believed that moon formation occurs during 
runaway growth, that is, when the proto gas giant becomes large enough to 
accrete gas directly from the protoplanetary nebula. The resulting moons 
are believed to form within the resulting circumplanetary accretion disk. 
Finally, for the case of captured moons, the eventual moon properties depend 
less on the mechanics of planetary formation and more on the population of 
objects capable of being tidally captured. 
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3.3.1 Impact generated moons 



Simulations of the period of chaotic growth, the phase when it is thought 
that large impact-generated moons formed, shows that giant impacts which 



are able to produce moons are common (e.g. Agnor et al. , 1999 ). Simulations 



of the impact process indicate that impacts between terrestrial mass proto- 
planets (M p < 2.5M^) produce a disk of orbiting debris ( Wada et al. , 2006) 
which can coalesce into a moon a couple of planetary radii from its host an 
orbit which can have any value of inclination. In addition, for standard sized 
terrestrial planets (0.5 - IMq), the dynamics of the impact and interactions 
between the debris during post impact evolution generally result in a single 
(Canup et al. 1999), large moon containing up to 4% of the planets mass 



(e.g. Canup et al. 2001 ). As a result of the random nature of the impact, this 
moon can have any initial orbital inclination. However, depending on this 
inclination, the dynamical evolution of the planet-moon system can result in 
re-impact, a moon on a close, inclined orbit, or a moon on a distant coplanar 
orbit ( |Atobe & Ida]|2007[ ). 

While only one of the four terrestrial planets in our Solar System has 
an impact generated moon, giant impacts have been invoked to explain 



Mercury's high density ( 


Benz et al. 


(Alemi & Stevenson 




2006 


). In addit 



1988) and Venus' retrograde rotation 



In addition, giant impacts have been proposed 



to explain the satellite system of Pluto (McKinnon, 1989 



and the high obliquity and satellite system of Uranus (Korycansky et al. 



1990 Slattery et al., 1992). 



Stern et al. 2006) 



3.3.2 Disk generated moons 

It is believed that the regular satellites of Jupiter, Saturn and Uranus formed 
within a circumplanetary disk. This disk may have been the accretion disk of 
its host planet as it accreted gas and solids from the protoplanetary nebula 
or, for the case of Uranus, the disk could possibly have been one that was 



stochastically generated, by a giant impact (Korycansky et al. , 1990, Slattery 



et al. 1992). Independent of the source of the disk, satellite growth within 
a disk explains the circular orbits of the regular satellites and their low 
inclination with respect to planetary rotation. However, the specifics of the 
method by which a disk of gas and solids is processed into a small number of 
large satellites is not fully understood. In addition to the moon properties 
naturally resulting from accretion from a disk, any proposed model must 
also be able to explain the masses, distribution of semi-major axes and 
formation timescales of each of the three sets of regular satellites. Currently 
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there are two main models for this process, that of Canup & Ward (2006) 



and Mosqueira & Estrada ( 2003a|b ), presented in the literature. These two 
models, along withe their associated moon formation predictions will be 
discussed in turn. 



Canup k, Ward (2006) use a time-dependant, single component circum- 



planetary disk model to investigate regular satellite formation. They suggest 
that the properties, in particular, the mass, of regular satellites within this 
disk is determined by the balance between the rate of accretion of material 
onto the protomoons, and orbital decay of these protomoons within the ac- 
cretion disk onto the growing gas giant. This process results in an ordered 
set of approximately 4 large moons within 60 planetary radii of the planet, 
with total mass approximately one ten thousandth of their host planet. This 
model addresses the issue of formation timescales by proposing that undif- 
ferentiated moons e.g. Callisto, started forming later than their comrades, 



and remained undifferentiated as a result of cooler disk conditions (Barr & 



Canup, 2008). 



In comparison, |Mosqueira fc Estrada| f2003a[bl ) propose a two-component 
disk model with a dense inner sub-disk surrounded by a less dense outer disk. 
The edge of this inner disk was set at the centrifugal radius, R c , the radius 
of the orbit of a gas parcel around the planet, such that the gravitational 
force from the planet and centrifugal force on that parcel balance, defined 
as 



Rc. 



J 



■2 



GM- U 



(3.1) 



where j is the angular momemtum of the gas parcel, G is the universal 
gravitational constant and M p is the mass of the planet. This model was 
in part proposed to explain the much longer formation timescale measured 



for Callisto (Anderson et al. , 1998) than its neighbour Ganymede (Schubert 



et al. , 1996). Unlike the model of Canup & Ward (2006), this model predicts 



that the migration timescale of moons is much longer than their formation 
timescale, mainly as a result of gap opening. While the model qualitatively 
describes the ratio of moon mass to planet mass, it does not provide a firm 
limit, however it does predicts that at most one large satellite should be able 
to form outside the centrifugal radius. 3 



' Mosqueira & Estrada ( 2003a I use an analytic approximation for the centrifugal radius 
(R c w 7?if/48), which was derived for the case of distant gas giants. More accurate 



approximations based on simulations are also available (Machida et al. 2008 Machida 
|2009[ ), which are also derived for the case of distant gas giants. However, for the case 
of planets which are close to their parent star, where the planet may take up a non- 
negligible fraction of the centrifugal radius, it is unclear whether these expressions are 
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3.3.3 Captured moons 



Planets can also obtain large satellites through tidal capture, e.g. Neptune's 
moon Triton. Currently, 4 the only model that can reliably produce large 



captured moons is the tidal capture model of Agnor & Hamilton (2006). 



Consequently we will discuss this model in the context of producing large 
moons of extrasolar planets. 



Agnor & Hamilton (2006) suggest that if a binary system (similar to 



Pluto-Charon) passed sufficiently close to a host planet, the orbits of the 
binary pair could be perturbed enough such that one member of the binary 
gained energy and was ejected while the other lost energy and remained 
orbiting the planet. In particular, the binary is likely to be disrupted if a&, 
the semi-major axis of the binary is approximately equal to its Hill radius, 
that is, 

f M 1 + M 2 \ 1/3 
a b = rtd I — ) , (3.2) 



3M„ 



where rtd is distance of closest approach and where M\ and M 2 are the 
masses of the two components in the binary respectively. Also, while Agnor 



& Hamilton (2006) found that it was possible for either component to be 



captured, they found that there was a preference for capturing the lowest 
mass component, and for this captured moon to be in a retrograde orbit. 

Once this capture has taken place, the new moon will be on a highly 
elliptical, probably retrograde orbit, with pericenter distance approximately 
equal to rtd- As this moon crosses the region where regular satellites are 
likely to have formed, either they, or the new moon are likely to be destroyed 
or ejected. Consequently, if a planet has a large captured moon, it should 
be the only large moon. The new moon's orbit will then tidally circularise, 
such that a m ~ 2r^. We can use this to work out the orbital elements of 
the new moon in terms of the orbital elements of the original binary. To 
maximise captured moon mass, we assume that the primordial binary had 
two equal mass components (M m = Mi = M 2 ). Using equation (3.2) to 



still valid. Consequently, for the case of close in gas giant planets, the location of large 
moons may still be determined by the centrifugal radius, except that the position of R c 
for such planets is currently unknown. For this thesis we use the approximate formula, 
R c ~ Rh / '48 (Cassen & Pettibone 1976 Stevenson et al. 19861, as this gives the largest 



centrifugal radius for close in planets. 

4 During the final stages of preparation of this thesis Podsiadlowski et al. (20101 pro- 



posed a new tidal capture model. This model is capable of producing gas giant-gas giant 
binary planets separated from each other by a couple of solar radii. 
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substitute for r t d gives 



,2M, 

or 



2* Ui/" > (3-3) 



a? 



M m = 12M P ^. (3.4) 



As can be seen from equation (3.4), the maximum mass of a tidally captured 
moon is inversely proportional to the cube of the final semi- major axis of 
that moon. Also, if the rotation of the planet is prograde (a likely con- 
sequence of planet formation) and the captured moon's orbit is retrograde 
(a likely consequence of the capture process), the moon will tidally evolve 
inwards towards its host planet, reducing its semi-major axis still further. 
Consequently this formation mechanism is only capable of producing large 
moons close to their host planet. 

In addition, the efficacy of this moon formation channel also depends 
on the population of objects, particularly binary objects, available to be 
captured. As the formation models proposed for binary trans-neptunian 
objects require that both objects have large Hill spheres, that is, they are 



distant from their host star (e.g. Goldreich et al. 2002), it follows that a 



sufficiently large population of large binary objects can only form far from 
their host star. Consequently, the only planets capable of capturing such 
moons must also be distant (so, it is no coincidence that Neptune, the most 
distant of the gas giants is the only gas giant to host a large captured moon). 

As the two detection methods investigated in this thesis require that the 
moon be distant from its host planet and massive (pulsar timing) or orbit a 
planet close to its host star and be large (transit technique), this formation 
method does not seem a promising way of producing large detectable moons. 
Consequently, within the context of this thesis, the issue of captured moons 
will not be focussed on. 



3.4 Stability constraints 

The properties, most particularly the orbital properties of the moons of a 
given planet depend not only on how and where they formed, but also on 



their subsequent evolution. As first pointed out by Barnes & O'Brien (2002), 
this evolution is governed by two main factors, the slow secular change of the 
moon's orbital parameters resulting from orbital perturbation, and the more 
rapid irreversible loss of moons due to tidal disruption, planetary impact or 
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three body instability. These factors will be discussed in turn and then 
combined to produce a summary of the mass and distance limits presented 
in the literature. 



3.4.1 Moon orbital evolution 

The main method presented in the literature by which a moon's orbit slowly 
evolves is though the perturbation on the moon's orbit caused by the tidal 
bulge raised on the planet by the moon. For the case where angular veloc- 
ity associated with the planet's rotation and the angular velocity associated 
with the moon's orbit are equal, the tidal bulge induced on the planet by the 
moon is symmetric across the line joining the planet and moon, and no angu- 
lar momentum is exchanged between the rotation of the planet and the orbit 
of the moon. However, for the case where the planet rotates faster/slower 
than the moon orbits it (i.e. the planet is not tidally locked to the moon), 
the planet's tidal bulge is "dragged" ahead of/behind the line joining the 
planet and moon by an angle 5, which is defined in terms of the tidal dis- 
sipation parameter Q p , via tan(2<5) = 1/Q p . The gravitational interaction 
between this asymmetric bulge and the moon allows an exchange of angular 
momentum between the planet's rotation and the moon's orbit, leading to a 
concomitant increase or decrease in semi-major axis depending on whether 
the planet is rotating faster or slower than the moon. In particular, from 



Murray & Dermott (1999 p. 164), we have that the torque on the moon 



due to the tidal bulge of the planet is given by 

3 k 2p GMjRl 

T p -m = 7y — 7\ — 6 sgn(n Piro4 - n m ) (3.5) 

where G is the universal gravitational constant, k 2p is the tidal Love number 
of the planet, and n p ^ ro t and n m are the angular velocities associated with 
the planet's rotation and the moon's orbit respectively. In addition, we note 
that that sgn, the signum function returns 1 if its argument is positive, if 
its argument is zero, and -1 if its argument is negative. 



Following Barnes & O'Brien (2002), an equation for the evolution of a r 



can be determined as a function of the physical parameters of the planet 
and moon by noting that the torque on the moon is equal to the change in 
angular momentum, that is 

r p _ m = — (a 2 m M m n m ) . (3.6) 
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From Kepler's law we have that n^a^ 



GM p . Combining this expression 
with equations (3.5) and (3.6) and simplifying gives 



da m n k2 P GM m Rp 

—rr = 3 ; TT7^sgn(n^ rot -n m ). 

dt ^GW p Q p a% /2 



(3.7) 



Assuming that n P:rot — n m does not change sign over the course of the 
evolution, 5 this equation can be integrated to give an explicit form for a m as 
a function of time. For outward migrating moons {jip^rot ^m)? ^® obtain 



a m (t) 



a m (t ) 13/2 



(* - 1 ) 



33 k 2v GM m Rl 



2 jGM p Q p J 
while for inward migrating moons (n p ^ ro t < n m ) we obtain 

/ \ / ^ .13/2 fJ . x \ 33 ^2pGM m R p \ ^ 



(3.8) 



(3.9) 



where t is the current time and to is the time at which the moon formed. 
This equation governs the evolution of the moon's semi-major axis up until 
the moment it is destroyed or lost to the planet. 



3.4.2 Processes resulting in moon loss 

Planets can loose moons through two main processes. First, if moons are 
too close to their host planet, they will be destroyed, either through tidal 
disruption or by impacting with the planet's surface. Second, if the moon is 
too distant, it's orbit may become unbound from the planet as a result of the 
effect of the periodic perturbation on the moon's orbit by the gravitational 
field of the host star. These two processes will be discussed in more detail 
in turn. 

5 This is a reasonable assumption assuming that the rotation rate of the planet does not 
change over the lifetime of the system. If a moon is migrating outward, it is because the 
rotational period of the planet is shorter than the orbital period of the moon. As outward 
migration only increases the orbital period of the moon, the moon will continue to migrate 
outwards. Conversely, if a moon is migrating inward, it is because the rotational period 
of the planet is longer than the orbital period of the moon. Again, as inward migration 
results in a decrease in the orbital period of the moon, the moon will continue to migrate 
inward. However, the rotation rate of the planet may change if the moon is large enough 
to modify it, or if the rotation of the planet is influenced by an external factor, for example 
torque from the host star on the tidal bulges raised on the planet by the host star. We do 
not deal with this case in this thesis. 
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The most dramatic way in which moons can be lost to a host planet 
is through tidal disruption or impact with that host planet. To begin, we 
consider the distance from a planet at which a body held together by self 
gravity (i.e. a rubble pile) will disrupt due to tidal forces resulting from the 
planet's non- uniform gravitational field. This distance is called the Roche 
limit and is given by 



Rr — Rp 2 



Pm 



1/3 



(3.10) 



where R p is the radius of the planet and p p and p m are the densities of the 
planet and moon respectively. Depending on the ratio of densities of the 
planet and moon, this radius can range from a couple of planetary radii to 
within the planet. In addition, the moon will impact with the planet when 
the moon's orbit intersects with the surface of the planet. For the case of 
circular orbits, 6 this corresponds to a m Rp. 

In addition to moon destruction, moons can also be lost from a planet 
via three-body instability if the semi-major axis of the moon becomes too 
large. The motion of three bodies under their mutual gravitational fields is 
not a simple issue and can result in a range of complex behaviour. In partic- 
ular, the boundary between stable and unstable orbits is complicated, and 



most likely fractal. Fortunately, simple analytic (e.g. Mardlingj 2008b) and 



numerical (e.g. Barnes & O'Brien 2002; Domingos et al.[ |2006[ ) approxima- 
tions for this boundary are available. These approximations can generally 
be expressed in terms of the Rh, planetary Hill radius, the distance from 
the planet where the planet's gravitational force and the tidal force from the 
star are equal, which is given by 



Rh 



M p 
3M, 



1/3 



(3.11) 



where a p is the semi-major axis of the planet's orbit, M p is the mass of the 
planet and M s is the mass of the star. 7 In particular, the approximations 



used by|Barnes & O'Brien|(2002) and Domingos et al. (2006) in their studies 

a m ,max = 0.36Rh, (3.12) 



of moon stability were 



There are dynamical reasons why close in moons should be in circular orbits around 
their host. To see why, note that the timescale for the decay of the moon's orbital ec- 
centricity, T e , is proportional to a^,, resulting in a very rapid decay in orbital eccentricity 
for moons with small semi-major axes. In addition, as the timescale for the decay of the 
semi-major axis is equal to T e /e^, the semi-major axis always decays on a timescale longer 
than th at of the eccentricity , givin g moons a chance to circularise their orbits. 
7 See Murray & Dermott (1999 p. 116) for a derivation. 
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and 

a m ,max = 0.4895^(1.0000 - 1.0305e p - 0.2738e m ), 
for the case of prograde satellites, and 



and 



0.50R H , 



0.9309^(1.0000 - 1.0764e p - 0.9812e r 



(3.13) 



(3.14) 



(3.15) 



for the case of retrograde satellites respectively, where a m ^ max is the limiting 
semi-major axis of the moon's orbit and where e p and e m are the eccen- 
tricities of the orbit of the planet and moon respectively. As the effect of 



planetary eccentricity on moon detection will be investigated in chapter 10 



the approximation of Domingos et al. ( 2006 ) will be used in this thesis 



3.4.3 Limits on moon mass 



The information presented in section 3.4.1, on semi-major axis evolution, 



and the information presented in section 3.4.2 on the region of parameter 



space where moons are retained can now be combined to provide a lower 
mass limit on moons as a function of its initial semi-major axis using the 



method pioneered by Barnes & O'Brien (2002). Again, as the properties, 



most particularly the mass and the Q p value, of terrestrial planets and gas 
giants differ by a number of orders of magnitude, mass limits for these two 
cases will be discussed separately. 



3.4.3.1 Mass limits for moons of terrestrial planets 



As described in section 3.3.1 impact generated moons of terrestrial planets 



form a few planetary radii from their host, on a circular orbit which may 
be arbitrarily inclined with respect to the plane of the planetary orbit. In 
addition, they may contain a sizable fraction of their host planet's mass. 
While the moon is formed at this position, it will not remain there long. 



If the moon's orbit is highly inclined, simulations (Atobe Sz Ida 2007) show 



that while the moon's orbit initially evolves outward, the direction of evolu- 
tion will reverse, resulting in either the moon being deposited a few planetary 
radii from the planet or the moon reimpacting with the planet. In addition, 
if the spin axis of the planet is anti-aligned with that of the orbit of the moon, 
the moon's orbit will shrink, again leading to re-impact. Alternatively, if the 
moon's orbit is aligned with the planet's orbit, it will survive and rapidly 
migrate outwards as a result of the low Q p value (and consequent high rate 
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of tidal energy dissipation) of terrestrial planets. Thus, large, distant, moons 
of terrestrial planets, will have orbits which are roughly coplanar with the 
orbit of their host planet. 

For the case where the rotational evolution of the planet is not domi- 
nated by the moon (that is, the moon is small or the rotational evolution 



of the planet is dominated by stellar tides) we can use equation (3.8) to 



describe semi-major axis evolution as the sign of the (n Piro t — n m ) term in 



equation (3.7) will remain constant. From equation (3.8) we have that the 



location of the moon after a period of time, T, is given by 

2/13 

-) 

2 y/GMpQp 



i3/ 2 | T 33 k 2p GM m R» 



However, as impact generated moons can contain a sizable fraction of 
their host planet's mass, they can, and do noticeably modify their host 
planet's rotation rate as they evolve. For the case where the rotational 
evolution of the planet is dominated by the moon, the planet and moon 
may tidally lock to the planet (that is, the moon completes one full orbit 



per planetary revolution). In this case the sgn term in equation (3.7) is equal 
to zero, and consequently, the semi-major axis of the moon's orbit will no 
longer evolve as a result of this mechanism. Consequently, distant terrestrial 
planets may retain large moons that they otherwise would have lost. For 
example, a system in which tidal locking has occurred is the Pluto-Charon 
system. 

3.4.3.2 Mass limits for moons of gas giants 

In contrast to the satellites of terrestrial planets, regular satellites of gas 
giants do not form at a location, they form within a region. In addition, 
compared to the mass of their host planets, the regular satellites of gas 
giants are very small, and consequently unable to modify their host's rota- 



tion. Consequently, we can use equations (3.8) and (3.9), but, as a result of 
the range of initial formation locations, need to consider both inward and 
outward orbital evolution. 

For the case of terrestrial planets, we were able to predict the location of 
an impact-generated satellite after a given period of time, as a function of 
its mass. However, as moons of gas giants form within a region, we cannot 
perform a similar extrapolation for the case of moons of gas giants. However, 
we can ask what satellites (in particular what mass satellites) will still be 
extant, albeit in a different position, after a given period of time. 
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To answer this question we consider equations (3.8) and (3.9) the equa- 
tions governing a m for the case of outwardly and inwardly evolving moons. 
For the outward evolution case, the moon is lost to the system when its orbit 



becomes three body unstable, that is when a r , 

Q"m,max 



> a. 



m,max' 



Setting a m (t) to 



and rearranging equation (3.8) to get M m gives 



M m < 



2 

33 



13/2 
l m,max 



i(*0) 



13/2 



\ y/GM~ P Q P 



J Th 



2p 



GRl 



(3.17) 



for the case of outwardly evolving moons. For the inward evolution case, 
the moon is lost to the system when in impacts with the planet or when it 
is tidally disrupted. And as tidal evolution occurs rapidly for a m « R p (r oc 
(R p /a m ) 5 ), Rr and R p can be considered equivalent with respect to deriving 



moon mass limits. Setting a m (t) to R p and rearranging to equation (3.9) to 
get M m gives 



M m < 



33 



1 13/2 



13/2 \ s/GMpQp 
p ) Tk 2 pC r >~ ' 



,GRl 



(3.18) 



for the case of inwardly evolving moons. 



Physically, equations (3.17) and (3.18) place limits on the maximum 



mass moon that can exist around a given gas giant. For example, using the 
stability criterion of Barnes & O'Brien (2002) for a Jupiter- like planet 8 on a 
circular orbit with a p = 0.2AU, we have that extant moons must have mass 
less than 0.88M^. This limit rises to 6.6Afm for the case where the more 



generous stability criterion of Domingos et al. (2006) is used. 



3.5 Summary of literature moon limits 

Informed by the census of the large moons present in the solar system, the 
formation mechanisms and stability properties of large moons have been dis- 
cussed. As discussed at the beginning of this chapter the aim of this analysis 
is to provide a context in which to view moon detection thresholds. Conse- 
quently, the limits for terrestrial and gas giant planets will be summarised 
with this in mind. 



3.5.1 Limits for moons of terrestrial planets 

Constraints can be placed on the physical and orbital characteristics of 
impact-generated moons as a result of limits imposed by the physics of 

8 M P = 1.8986 x 10 2 7kg, R p = 71398km, k 2p = 0.51 and Q p = 10 5 . 



42 



the formation process, moon orbital evolution and stability. As discussed in 



section 3.3.1 impact generated moons form from the disk of debris resulting 
from a collision between an planet-sized impactor and a planet. For Earth- 
like terrestrial planets, the impact process is capable of lifting a maximum 
of 4% of the total mass contained within of the planet and impactor into 
orbit, while for larger mass planets, shocks may form in the debris field, re- 
sulting in most of the mass being ejected. Consequently, the mass available 
to form a moon is limited to at most 4% of the total mass contained within 
the two bodies for Earth-sized terrestrial planets, and substantially less for 
larger planets. As a result of post-impact interactions, this disk coalesces 
into a single moon. While this moon can be formed with any initial orbit 
orientation, only moons with orbits that are approximately aligned with 
the planetary orbit will survive and undergo outward orbital evolution. In 
particular, the semi-major axis of the moon will evolve in accordance with 



equation (3.16). To summarise these results, these limits on moon mass 



and semi-major axis are shown in figure 3.3 for the case of a Earth-like host 
planet. 

3.5.2 Limits for moons of gas giants 

Similarly, an understanding of the processes related to moon formation and 
stability can constrain the set of moons of gas giants that are predicted to 
form. As discussed in section [3 .3. 2 it is believed that the regular satellites of 



gas giants formed within a circumplanetary disk, resulting in a set of moons 
with orbits which are nearly exactly aligned with the equatorial plane of 
their host planet. As the equator of a gas giant should show a tendency to 
be aligned with the orbital plane, this corresponds to a tendency for the orbit 
of moons of gas giants to also be aligned with the orbit of their host planet. 
In addition, models for this process suggest that the total mass contained 
within large moons of a gas giant should be approximately 2.5 x 10 -4 of 
that gas giant's mass, and the majority of large moons of gas giants should 
form within a certain distance (be it 60 R p or R c ) of their host planet. Once 



formed, the moons will migrate according to equation (3.8) or (3.9), and 



consequently, the maximum mass of a moon formed at a semi-major axis of 



i{to) and still retained within the system is given by equations (3.17) and 



(3.18) for the case of outward and intward migration respectively. These 



limits are summarised for the case of a Jupiter-like 9 planet in figure 3.4 



9 The term Jupiter-like is a little misleading as some of the properties of Jupiter, in 
particular, its Q- value, are not well constrained. Recent result suggest that the Q- value of a 
gas giant may depend on the forcing frequency, in this case, on the orbital frequency of the 
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Figure 3.3: Plot showing the constraints placed on moons of an Earth- 
analog terrestrial planet (M p = 5.9736 x 10 24 kg, R p = 6378km, k 2p = 0.299, 
Q p = 12) as a result of their formation and consequent orbital evolution. 
The regions where the moons can never exist (inside the planet and outside 
the stability region) are cross hatched. The upper mass limit is denoted by 
a dashed line. Finally, for comparison, the semi-major axis an outwardly 
migrating moon of given mass would evolve to is shown by a blue line, for 
the case of a system age, T, of 5 Gyr. 
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Figure 3.4: Plot showing the constraints placed on moons of a Jupiter-analog 
gas giant as a result of their formation and consequent orbital evolution. The 
regions where the moons can never exist (inside the planet and outside the 
stability region) are cross hatched. The mass limit proposed by Canup & 



Ward (2006) and the radius limit proposed by Mosqueira & Estrada (2003a) 



are denoted by dashed lines, and labeled. Finally, for comparison, the set of 
moons that, once formed would survive inward and outward migration are 
denoted by blue and red regions respectively, for the case of a system age, 
T, of 5 Gyr. 
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3.6 Conclusion 



By considering the census of moons within the Solar System along with the 
current understandings of moon formation and orbital evolution, we have 
summarised the types of large moons that we expect extra solar planets to 
host. Terrestrial planets are expected to host a single large moon (if any) 
containing up to 4% of the planet's mass. In addition, the semi-major axis 
of this moon will be defined by the system lifetime and the orbital evolution 
timescale of the moon. In comparison, gas giant planets are expected each 
to possess a small number of large moons, where the total mass contained 
within these moons is approximately equal to 2.5 x 10 -4 M p . In addition, 
these moons should form relatively close to the planet (either R c or 60R P ) 
and then slowly evolve inward or outward depending on whether the rota- 
tional period of the planet is longer or shorter than the orbital period of the 
moon. From this context we can start to look at the set of proposed moon 
detection mechanisms. 



moon . While Jupiter's Q-value ha s been measured to be 10 s — 10 6 by (Goldreich & Soter 
1966| ), and (3.56 ±0.66) x 1 4 by dLainey et al ||2009||, there are good theoretical reasons 
for it to be as high a s 10 12 |Goldreich k, Nicholson 



work (Ogilvie & Lin 



2004 



1977 



Wu 



2005). Recent theoretical 

12\ 



Wu| |2005| suggests that the naturally high (10 12 ) Q-value of 
gas giants could be suppressed (10 s — 10 9 ) for the case where the forcing frequency is less 
than twice the spin frequency of the planet, neatly explaining both results. For this work 
we use a constant Q-value of 10 5 , but note that the adoption of a Q-value of 10 12 allows 



moons of even hot Jupiters to be dynamically stable ( Cassidy et al. 2009 1 
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Chapter 4 



Review of planet and moon 
detection techniques 

4.1 Introduction 

To provide a context for the two moon detection methods analysed in this 
thesis, it would be instructive to review the set of moon detection meth- 
ods presented in the literature, and the types of moons they are capable 
of detecting. As moon detection techniques are in part inspired by planet 
detection techniques, and a planet must be detected using one of these tech- 
niques before a moon of that planet can be detected, we will begin our 
discussion with a brief overview of planet detection. In particular, we will 
look at planet detection techniques in terms of the physics they are based 
on, how effective they are at finding planets, and the types of planets likely 
to be discovered using them. Building on this framework, moon detection 
will then be discussed. In particular it will be discussed with respect to 
physics of the detection method, the type of host planet required, and the 
types of moons that can be discovered. 

4.2 Planet detection techniques 

Currently, over 500 extrasolar planets have been discovered. 1 These dis- 
coveries have been made using a variety of methods, including, the radial 
velocity, transit, microlensing, timing and direct imaging techniques. In ad- 
dition, other techniques, such as astrometry, have been proposed, but have 

1 See e.g http://exoplanet.eu/catalogue.php 
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Figure 4.1: Plot of all known extrasolar planets (small dots) as of the 5 th of 
April 2010, colour-coded by detection method. For the cases where M p s'ml p 
is known, while M p is not, for example, the planets detected by radial ve- 
locity, M p sin I p has been used in place of M p , as it indicates the lowest 
possible value of M p . For comparison, the planets of our Solar System are 
also shown. 



yet resulted in no successful detections of planets. To provide a context 
for the upcoming discussion on moon detection, these techniques will be 
described in terms of the physical basis for the detection technique, and 
the type and number of planets detected. For reference and comparison, a 
diagram showing all the planets as of 5 th of April 2010 is given in figure 4.1 



We will begin with the radial velocity technique, the most successful planet 
detection technique to date. 
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4.2.1 The radial velocity technique 



The radial velocity technique is a planetary detection technique which in- 
volves measuring the reflex "wobble" of the planet's parent star due to the 
planet's motion. Practically, this is done by measuring the periodic shift of 
the absorption lines in the star's spectrum and converting this to a line-of- 
sight velocity. The amplitude of this velocity for a planet on a circular orbit 
is given by 



. M„ GM S . . 

max{v p j os ) = -f* sm/p, (4.1) 



M s y a p 



Mpsinip /M w \ 1/2 /IAEA 1/2 , , , 

28.4 p - p ^ ms" 1 , 4.2 

Mj \M S \ a v y ' 



where M p and M s are the mass of the planet and star respectively, and, a p 
and I p are the semi-major axis and line-of-sight inclination of the planet's 



orbit. As can be seen in equation (4.2) and figure 4.1 this technique is more 



sensitive to planets in close orbits, but is still capable of detecting more 
distant planets. While this technique is capable of detecting planets with 
any orbital inclination, it does have the drawback of only yielding M p sin I p 
where M p is the true mass of the planet and I p is the inclination of the 
orbit. Despite requiring high resolution, high signal-to-noise spectra that 
are expensive in terms of telescope time, it is currently the most successful 
planetary detection technique, with nearly 500 planets detected. 



4.2.2 The transit technique 

For the case where a planet's orbit is aligned such that it transits across the 
face of its host star as viewed from the Earth, the presence of the planet can 
be inferred by the corresponding dip of the intensity of its host star during 
this passage. Such a transit will have a depth of approximately (R p / B? s )Lq, 2 
where Rp and R s are the radius of the planet and star respectively, and Lq 
is the unoccluded luminosity, and for a planet on a circular orbit, can last 



2 Note that Rp/R^ is equal to the ratio of the projected area of the planet to that of 
the star. 
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depending on the planet's orbital inclination. As planets can only be de- 
tected if they transit (probability of transiting is oc a~ 1 ) and during transit 

(time between transits is proportional to a p ), this technique is strongly 



biased toward detecting short period planets (see figure 4.1). While not all 
planets will be detected, this technique is photometric as opposed to spec- 
troscopic, which means that each star in a crowded field can be monitored 
simultaneously, so many more stars (and many fainter stars) can be inves- 
tigated for planets. So far, 115 transiting planets are known. 3 This method 



is discussed in greater detail in section 7.2 



4.2.3 Microlensing 

Planets can also be detected by the perturbations caused by their gravi- 
tational field. To see how, consider a relatively nearby star (the lens star) 
moving against a background of more distant stars. If the projected distance 
on the sky between this lens star and a background star (the source star) 
becomes small enough, the gravitational field of the lens star can perturb 
the path of photons leading to image the image of the source star being mag- 
nified or demagnified. In addition, if this lens star has a planet, the planet 
can lead to additional spikes in the light curve of the source star. This tech- 
nique is sensitive to planets with a projected distance from their host star of 
approximately one stellar Einstein radius, where the stellar Einstein radius 
is defined as 

R E = D L E , (4.5) 



3 The number of planets discovered using the transit technique is not an easily definable 
quantity. This is partially because, in order to be confirmed, a planet detected via the 
transit technique must also be detected using radial velocity. In addition, planets initially 
detected using radial velocity may later be discovered to transit. 
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where 9e is the angular einstein radius, G is the universal gravitational 
constant, and Dl and D$ are the distances between the observer and the 
lens and the observer and the source respectively, and where the scale dis- 
tance for Dg of 8kpc (the Milky Way bulge) could just have equally been 
50kpc (Large Magellanic Cloud), or 60kpc (Small Magellanic Cloud). This 
distance ranges from 1AU to 5AU for typical microlensing systems, which 
corresponds well with the 1AU to 5AU band in which microlensing planets 
have been discovered (see figure 4.1). Currently, twelve planets, in eleven 
systems have been detected using this technique. 



4.2.4 Timing 

The timing technique involves measuring perturbations in the arrival time of 
periodic events associated with a host star, resulting from the reflex motion 
caused by orbiting planets. Example periodic events include radio pulses 
associated with millisecond pulsars, pulsations associated with giant stars 
or with white dwarf stars and eclipses of binary stars. The timing amplitude 
for a planet in a circular orbit about a host with mass M s is given by 



max(t 



pert ) 



IM P 

-— a p sm/ p , 

M p sml p a p M c 
Mj 1AU mT S ' 



0, 



(4.8) 
(4.9) 



where M p and M s are the mass of the planet and host, a p is the semi-major 
axis of the planet's orbit and c is the speed of light. Similar to the case for 
the radial velocity technique, the planetary mass appears only in the term 
M p sin I p . Consequently, only M p s'ml p can be directly measured. However, 
for the case of millisecond pulsar host stars, this technique is so sensitive, 
that second order effects, such as resonance effects, can be used to define 
limits to the orbital inclination and consequently provide an estimate of the 
planet's mass (Konacki & Wolszczan, 2003). Currently ten planets have 



been discovered using this technique, four in two millisecond pulsar systems 



(Wolszczan & Frail 1992; Wolszczan, 1994 Backer et al. 1993), one around 



a pulsating horizontal branch star (Silvotti et al. , 2007) and and five planets 



in three circumbinary systems (Lee et al. 2009 Qian et al. 2009 2010). 



This technique is discussed further with respect to millisecond pulsars, 
Part HQ 



m 
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4.2.5 Direct imaging 



As its name suggests, the technique of direct imaging involves detecting 
planets through an image. Unfortunately, making an image of an extra- 
solar planet is challenging for two main reasons. First, host stars are many 
orders of magnitude brighter than their attendant planets (10 9 for the case of 
a Jupiter analog around a Sun-like star). Second, the planets are very close, 
generally in the wings of the stellar point-spread function. Consequently, 
special techniques such as coronography or adaptive optics need to be em- 
ployed to implement this method, and even then, it is most sensitive to hot 



distant planets. While distant hot planets are uncommon (Nielsen et al. 



2008), they do exist. Currently, using this technique, seven planets have 



been detected around four stars. 4 Despite the technical challenges imposed 
by this method, it can potentially offer high returns in terms of planetary 
followup, for example, detection of the planetary spectrum. As a result, the 
capability to directly detect extra-solar planets is a strong science goal in a 
number of proposals for the next generation of telescopes, for example, the 
Terrestrial Planet Finder (TPF) and the Giant Magellan Telescope (GMT). 



4.2.6 Astrometry 

Astrometry is a planet detection technique that involves measuring the mo- 
tion of a star about the planet moon barycenter via the perturbation of the 
star's position on the plane of the sky. Unlike the radial velocity technique, 
this technique can reveal a planet's true orbital inclination and, combined 
with an estimate of the star's mass, the planet's true mass. For circular 
orbits, a, the maximum amplitude of such an angular displacement is given 
by 



a 



M p a p 

WIT) 



0.3- 



M Q M p 



a,, 



lpc 



M, M ffi 1AU D 



-//as, 



(4.10) 



where M p is the mass of the planet, M s is the mass of the star, a p is the semi- 
major axis of the planet's orbit and D is the distance between the host star 
and the Earth. As the amplitude of the signal is inversely proportional to D, 
and proportional to a p , this method is optimised to detect distant planets 
of nearby low-mass stars. As of the 24 th of March 2010, no known extra- 



solar planets has also been successfully detected astrometrically (Pravdo k, 



Shaklan 


2009 


Bean et al. 


2010) 



However, future space missions, most 



4 While other candidate planets have been detected, the errors in their masses are so 
large that their planetary status is uncertain, or they are orbiting brown dwarfs. 
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notably SIM, 5 are planned to use this technique to find low mass distant 
planets. 



4.3 Moon detection techniques 

To date, no extra-solar moons have been discovered, with only upper limits 



placed on moon radii and mass of the planets orbiting HD 209458 (Brown 



et al. 2001) and OGLE-TR-113b (Gillon et al. 2006). Despite this, a num 



ber of investigations have been conducted into possible methods for detect- 
ing extrasolar moons. As most of these methods are extensions of planet 
detection techniques, the corresponding moon detection techniques will be 
summarised in the same order used in the preceding section. In addition, 
the properties of these detectable extra-solar moons are summarised in fig- 
ure |4.2[ according to the type of host planet, that is, the type of method 



used to detect the host, and the moon detection technique used. 
4.3.1 Radial velocity perturbation 

As the majority of planets have been discovered using the radial velocity 
technique and approximately half of these planets are distant enough from 
their stars to host sizable moons (see section 3.4.3), it seems intuitively 



sensible to try and extend the radial velocity technique to search for moons 
of these radial velocity planets. While the possibility of using this technique 



has been suggested in the literature (e.g. Szabo et al. , 2006), it was qualified 



by the statement that success would be "unlikely" due to the small signals 
produced. 

Currently, no analysis of the radial velocity perturbation specific to 
planet-moon systems has been performed. However, analogous systems have 



been studied. Schneider & Cabrera (2006) studied the radial velocity per 



turbation caused by a pair of equal mass binary stars on a companion and 
found the amplitude of the perturbation to be 



max(v pert ) 



IGM A a% 



(4.11) 



where G is the universal gravitational constant, Ma is the mass of the com- 
panion star, cla is the distance between the companion star and the center 
of mass of the binary, and is the distance between one of the binary stars 



5 See for example, http://sim.jpl.nasa.gov/keyPubPapers/SIMLiteBook/SIM-Book- 
Full-Book-LR.pdf. 
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Figure 4.2: A diagram using the same scale and colour scheme as figure 4.1 



summarising the types of moons which can be detected as a function of 
planetary detection method. Information on the type of moons which can be 
detected, is provided inside the cartoon representation of the region in which 
planets have currently been discovered using a given method. For the case of 
planets detected by timing, no cartoon was included as the detectability of 
a given planet depends more on the properties of its host that its properties. 
Again, for comparison, the Solar System planets are overlaid. 



and the center of mass of the binary pair. Designating one of the binary 
pair to be the "planet", and the other the "moon", and setting a m = 2Ab, 
then setting the mass of the companion to the mass of the host star and 



'Schneider & Cabrera (2006) defined as to be the distance between the center of mass 
of the binary pair and one of the components. For this thesis a m was defined to be the 
semi-major axis of the orbit of the moon relative to the planet. From the perspective of 
the "planet" star, the "moon" star orbits at a distance of 2as (as to get from the "planet" 
star to the center of mass, and as to get tot he "moon" star). Consequently a m = 2as- 
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= a a, equation (4.11) becomes 

9 IgM s aL , A 
max( Vrt ) = ? ^^, (4-12) 

-k^) www- 1 - <-> 



From equation (4.13) we have that the perturbation is maximised for 
planet moon pairs that are massive, distant from each other, but close to 
their host star. For such large moons, the minimum size of a p is limited by 



the timescale for orbital decay to approximately 0.6AU (Barnes & O'Brien 



2002 ) , and the maximum size of a m is limited by three body stability to 



0.5Rh for prograde orbits and ~ Rh for retrograde orbits (see section 3.4.2 ). 
Setting the mass of the host star to O.IMq, setting a p = 0.6 AU, setting 
a m = Rh and setting the mass of the planet and moon to 5Mj gives a 
perturbation signal of amplitude 3.7ms -1 . As that these are very extreme 
conditions, and radial velocity measurements are currently limited to a little 
below lms -1 due to stellar surface motion, detection is indeed "unlikely". 
Consequently, this method is unlikely to be of practical use. 



4.3.2 Perturbation to transit light curve 

Moons of a transiting planet can alter the light curve in a number of ways, 
each corresponding to different physical processes. Consequently a number 
of different methods, and corresponding test statistics have been proposed 
and investigated in the literature. These include: 

• Direct Detection: Detecting the extra dip in the light curve due to the 
moon. 

• Barycentric Transit Timing: Detecting motion of the planet around 
the planet-moon barycenter through the possible lead or lag in the 
planetary transit mid-time. 

• Transit Duration Variation: Detecting motion of the planet about the 
planet-moon barycenter through variations in planet transit duration. 

• Photometric Transit Timing: A hybrid test statistic which measures 
the distortion of the light curve due to the transit of the moon. 

To highlight the physics underlying each of the methods, formulae will be 
derived, and the corresponding moon detection thresholds for each method 
compared. 



55 



Time 



Figure 4.3: Diagram showing the different portions of the transit light curve 
for the case where both a planet and moon transit. Four silhouettes of the 
planet and moon are shown, corresponding to the the beginning and end of 
planetary ingress, and the beginning and end of planetary egress. Planet- 
moon pairs which correspond to a single silhouette are joined by a solid line, 
while the location of the planet-moon barycenter is indicated by cross. As 
the position of the planet-moon barycenter is a linear function of time it can 
be used as a proxy for time. Consequently the position of the barycenter 
and the value of the light curve resulting from that position are linked by 
dashed lines. 



Unfortunately, as different authors use different underlying assumptions, 
the thresholds cannot simply be copied from the respective papers. In partic- 
ular, the issues of the size and orbital parameters of the moon, the number of 
observed transits, the detection threshold and the type of photometric noise 
will be discussed. For simplicity and ease of comparison it was decided to 
investigate moons on circular orbits which are aligned to the line-of-sight. 7 
In addition, guided by the properties of Solar System moons, it was decided 
to investigate the case where the moon is small compared to the planet (i.e. 



7 For a discussion of why moon orbits which are coplanar to the orbits of their host plan- 



ets and moon orbits which are aligned to the line-of-sight are equivalent, see section 8.2.1 
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Figure 4.4: Approximate moon detection thresholds for the direct detection, 
barycentric transit timing and transit duration variation methods calculated 
using equations (4.18), (4.25) and (4.28). It is assumed that the length of 



time over which observations were recorded was four years, corresponding 
to N = 4 for the case of a p = 1AU and N = 44 for the case of a p = 
0.2AU. For the case of the barycentric transit timing and transit duration 
variation methods, the thresholds for rocky (p m = 5515kgm~ 3 ) and icy 
{Pm = 916.7kgm -3 ) moons are shown in brown and blue respectively. These 
thresholds are shown for four different cases, for a standard (a), and three 
comparison cases (b) - (d). The comparison between (a) and (b) shows 
the effect of changing from a Jupiter-like host planet to an Earth-like host 
planet. The comparison between (a) and (c) shows the effect of reducing the 
semi-major axis of the host planet from 1AU to 0.2AU (and consequently 
reducing the transit duration, but increasing the number of transits). The 
comparison between (a) and (d) shows the effect of decreasing the length of 
the chord the planet makes across the face star (and consequently increasing 
the duration of ingress while decreasing the transit duration). 
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Rr, 



< Rp and M„ 



<C M p ). To simplify the mathematics, it was decided 
to consider the case where N, the number of transits is large (i.e 3> 4). As 
a detection threshold needs to be selected, it was decided to use the three 
sigma (99.7%) detection threshold as this is the threshold that will be used 
in chapter [Toj Finally, while correlated noise can strongly affect planet de- 



tection (Pont et al., 2006), any correlated noise encountered will be specific 



to that system (combination of star, telescope, and observing conditions). 
Consequently, for ease of comparison, the formulae were derived under the 
assumption of normal uncorrelated noise. Within this context, each of the 
four methods introduced, will be discussed and analysed, in turn. 



4.3.2.1 Direct detection of the moon's transit 

Similar to the case for transiting planets, the passage of a moon across 
the face of the star can also block some of the star's light, resulting in an 
additional dip in the light curve. For the case of the planet-moon pair shown 
in figure [473] , the additional dip due to the moon is translated to the right of 
the dip due to the planet. This effect has been investigated in the literature 



in terms of the probability that a given moon will transit (Sartoretti &: 



Schneider, 1999), the depth of the dip produced (Sartoretti & Schneider 



1999) and the effect of mutual events, such as the moon eclipsing or being 



eclipsed by the planet (Sartoretti Sz Schneider, 1999; Cabrera & Schneider 



2007) on the shape of this dip. In addition to theoretical investigations, this 



method has been used to place a limit of 1.2 Earth radii on the radius of 



moons orbiting HD 209458 (Brown et al., 2001). 



By analogy with the planetary detection case, directly detecting a moon 
in a transit light curve involves comparing the average photon deficit due to 
the moon with the error in the average photon deficit due to the photometric 
noise. The degree to which the deficit due to the moon is larger than the 
error due to photometric noise determines whether or not a moon could be 
detected. From geometry we have that the average depth of the dip caused 
by the moon is given by 



l R m/ R2 s) L ' 



(4.14) 



where R m is the radius of the moon, Lq is the unoccluded luminosity of the 
star, and where the effect of mutual events has been ignored. 

Following |Pont et al.| fl2006| ) , we can estimate the error in the deficit due 
to photometric noise {gl per exposure) over the N exp exposures found in the 



Unfortunately, such large moons are not tidally stable (see section 3.4.3 1 so this limit 



is more a statement on the quality of the data than on the size of moons of HD 209458. 
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region of the light curve where the moon is transiting. 9 Assuming that the 
noise in one exposure is normally distributed and independent, the Central 
Limit Theorem gives 

(4.15) 



y N exp 

where a a is the standard deviation of the error in the measured depth and 
ol is the absolute photometric noise. 

To evaluate this expression, we require information about N exp . Con- 
sider, that as a result of orbital motion about the planet moon barycenter, 



the duration of the transit of the planet (e.g. Kipping 2008) and the moon 



will vary from transit to transit. However, as the time-averaged velocity of 
a moon about its host planet is equal to zero (if it wasn't, then it would 
drift away from its host planet), we can approximate the duration of a typ- 
ical moon transit with that of a typical planetary transit. Consequently 
we can approximate N exp as NTt ra / At, where T ra is the duration of the 
planetary transit for the case of no moon, At is the exposure time, and N 
is the number of transits. Thus 



On 



CTL 



V 'NTtra/ AT 



(4.16) 



Recalling that we are using an three sigma detection threshold, we have 
that, in order to be detected, the average depth of the dip due to the moon 
must be three times the average error in this depth, that is 



r>2 
- rt m 

m 



Lr 



0"L 



or 



0.0065R 



ATi/4 



ol/L 







3.95 x 10- 4 



y/ NT tra / At' 



At y/ 2 ' 

lmin / 



1/2 /13hrs\ 1/4 

V Ttra J 



(4.17) 



(4.18) 



While equation (4.18) indicates the detection threshold for the case of 



direct detection, it is not necessarily indicative of the true detection thresh- 



old for moons. For example, equation (4.18) neglects effects due to fitting 



the planetary light curve and the effects of correlated noise. In particular, 



Pont et al. (2006) showed that for planet detection, equation (4.18) became 



increasingly inaccurate as the transit duration and proportion of low fre- 
quency red noise increased. However, as Kepler is billed as being able to 



9 The question of how we know which sections of light curve correspond to when the 
moon is transiting is one that will not be addressed in this work. 
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detect Earth-like planets at 1AU in the presence of realistic photometric 
noise, it seems reasonable that it should be able to detect Earth- like moons 
of planets at 1 AU. 



4.3.2.2 Barycentric transit timing 

Moons of transiting planets can also be detected through the motion of 



their host planet about the planet-moon barycenter (Sartoretti & Schnei 



der, 1999). The technique of barycentric transit timing (TTV^) aims to 



detect this motion through deviations in the transit mid-time from strict 
periodicity (caused by the planet's physical displacement from the planet- 
moon barycenter). For example, in figure 4.3 the transit of the planet occurs 



earlier than would have happened had there been no moon. Using this tech- 
nique, Brown et al. (2001) and Gillon et al. (2006) placed upper limits on 



the masses of moons orbiting HD 209458 and OGLE-TR-113b of 3 Earth 
masses and 7 Earth masses respectively. 



As will be derived in section 10.4, the three sigma detection threshold 
for a sinusoidal signal for the case where N, the number of samples is large, 
is given by 

A 2 

^ <« 9 > 



13.95 



where A is the amplitude of the signal and a is the standard deviation of 
the noise. 



From (Sartoretti & Schneider, 1999) we that the amplitude of the timing 



signal caused by the moon is given by 



TTV h 



M T 
m M p 2ira p ' 



M„ 



M n V GM, 



(4.20) 
(4.21) 



where T p is the period of the planetary orbit, and where we note that Kipping 



(2009a) investigated and extended this expression for the case of eccentric 



moon orbits 
From 



Deeg (2002) we have that <Jt midp , the standard deviation of the 
timing error on this transit mid-time for a single transit is given by 



G7 



mid,p 



/ dL(ti, t m id,p) 



dt 



mid,p 



-1/2 



(4.22) 
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where the sum runs over the length of the planetary transit, where L(ti, t m id tP ) 
is the expression for a transit light curve centered at t m id tP at time U, and 
where it is assumed that only the transit mid-time varies from transit to 
transit. Noticing that the sum is dominated by the regions of the tran- 
sit light curve with the largest gradient, that is, the ingress and egress, 



equation (4.22) can be simplified. Following Carter et al. (2008) we approx- 



imate the transit light curve by three straight line segments, one each for 
the ingress, flat bottom and egress of the transit. 10 We also assume that 
the transit has depth a p and Tj n is the duration of both the ingress and 
egress. Consequently, for both the ingress and egress the square of the par- 
tial derivative in equation (4.22) is given by (a p /Ti n ) 2 . Noting that there 



are Ti n /At exposures during both the ingress and egress, and substituting 



the above approximation into equation (4.22) gives 



2 -^in J 

At \ f~ 



-1/2 



a L y/AtTj, 
y/2 a p 



(4.23) 
(4.24) 



Substituting these expressions into equation (4.19) and simplifying gives 



Rr, 



0.01681L 



ATl/6 



At 



3.95 x 10~ 4 V lmin 



o 



Rs 



1/3 



1/2- 



Pp_ 
Pm 



1/3 



/I3hrs\ 1/6 

V Ttra J 



1/3 



Rp 
OAR, 



1/2 



(4.25) 



where we have used equation (E.17) for T« n and the expressions a p = 
LoRp/R 2 , M m = /) m 4/37ri?^ and M p = p p 4/37rR p , where p m and p p are 
the densities of the moon and planet respectively. As can be seen by the 
dependence of R m on a m and p m , this statistic is good for detecting massive, 



distant moons (see figure 4.4). 



Unfortunately, other physical systems can also cause periodicities in tran- 



sit timing e.g. the presence of additional planet (Miralda-Escude, 2002 Stef- 



fen & Agol , 2005 ; Agol et al. , 2005 ) . In addition, as yet, no analysis has been 



performed on the effect of realistic stellar noise on this statistic. However, 
as the ingress and egress of the transit are relatively short compared to the 
transit duration, it seems intuitively reasonable that this technique would 
be more robust to the effects of red photometric noise than direct detection. 



For a discussion of the effect of neglecting limb darkening, see Carter et al. (20081. 
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4.3.2.3 Transit duration variation 



The transit duration variation technique (TDV) is an alternative way to de- 
tect motion of the planet about the planet-moon barycenter. This involves 
detecting transit to transit variation of the planetary transit duration to 
measure any perturbation in the velocity of the transiting planet as it moves 



around the planet-moon barycenter (Kipping| 2009a|). For example in fig- 
ure 



4.3 



the motion of the planet around the planet-moon barycenter is in 
the opposite direction as the bulk motion of the planet-moon pair, result- 
ing in a longer planetary transit duration than if there had been no moon. 



This method was introduced by Kipping (2009a), and extended by Kipping 



(2009b) to include inclined moon orbits. While it has not been used to set 
limits on moons of currently known planets, it is predicted to be able to 
detect moons as small as O.2M0 in Kepler data (Kipping et al. 



2009) 



From Kipping (2009a) we have that the amplitude of the timing per- 



turbation in the transit duration caused by a planet and moon on circular 
coplanar orbits is given by 



TDV n 




tra- 



(4.26) 



Where w e have transform ed from the root mean squared amplitude pre- 
sented in Kipping ( 2009a ) to an amplitude by multiplying by v2 and where 



it is assumed that M p <C M s . In addition, from Carter et al. (2008) we have 



that the expression for the error in transit duration due to photometric noise 
can be approximated by 



a 



At 



tra 



'- tra 



(4.27) 



f tra 



Using the method described in the previous section, equation (4.26) and 



(4.27) can be combined to give the to give the three sigma detection thresh- 



old, which is 



R r , 



0.0197i?< 



ATl/6 
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At 
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1/6 
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/13hrs\ 1/2 
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1/3 



(4.28) 



where pi ce , the density of ice is taken to be 916.7kgm 3 , and only the highest 
order terms in R p /R s have been retained. As R m decreases as a m decreases, 



this statistic is optimised for detecting close-in moons (see figure 4.4). 
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From the work of Kipping et al. (2009) it seems that this technique is 
relatively robust in terms of correlated stellar noise and instrumental vari- 
ability. They investigated the shape of the distribution of errors in transit 
duration for the case of white and synthetic red noise and found that the 
distributions agreed and were both normal. This result could be again due 
to the fact that the size of the timing error is dominated by the ingress and 
egress, and is not strongly affected by red noise as a result of the short time 
over which red noise has to act. 



4.3.2.4 Photometric transit timing 



The photometric transit timing technique (TTV p ) also uses the timing of 
transits to search for moons, but in this case, the time used is not the 
midpoint of the transit, but r, the first moment of the photon deficit caused 
by the planet-moon pair. Qualitatively, this statistic measures the position 
of the center of the transit, which is altered both by bulk motion of the 
transit signal and asymmetry of the transit light curve. The method was 



proposed by Szabo et al. (2006) as being equivalent to that of barycentric 



transit timing, but in a later paper (Simon et al. 2007) the differences 



between these two methods were discussed. Again, while this method has 



not been used to place limits on moons of known planets, Szabo et al. (2006) 



used this updated method to investigate the number of moons expected to be 
detected by missions such as COROT and Kepler. They investigated this in 
terms of both giant and terrestrial planets, and suggested that Earth-Moon 
type systems could be detected. 

While photometric transit timing has been investigated using a Monte 



Carlo simulation (Szabo et al. , 2006 ) and from a more theoretical standpoint 



(Simon et al. 2007), it is not possible to derive the signal form using liter- 



ature results. In particular, the only literature result relating to the signal 
is given in Simon et al. (2007), which states that the maximum size of this 



perturbation is given by 



max (TTVp 



Vtr 



Rrr 



M„ 



(4.29) 



As we do not have a signal form, 11 this cannot be converted into a 
threshold similar to those given. In addition, there is no expression given in 



llr The form of the TTV P signal can be guessed using equation (15) and Fig. 2 of 
Simon et al. (20071. Assuming that the silhouette of the planet and moon shown in this 



correspond to a planet and moon not at maximum separation, equation (15) can be used 
to show that the TTV P signal should be approximately a sinusoidal function of time. 
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the literature for the error in r assuming any type of noise, uncorrelated or 
otherwise. 



Consequently, in Part III of this thesis, these gaps will be addressed. 
First, in chapter [8j the form of r as a function of time will be derived for 
the case of circular and coplanar planet and moon orbits. This analysis will 
then be extended to the case of eccentric planet orbits. Then, in chapter [9j 
the functional form and standard deviation of the probability distribution of 
the error in r caused by photometric noise will be derived and investigated 
for the of white noise and more realistic correlated photometric noise. These 



results will then be combined in chapter 10, with the aim of first, deriving 



a simplified equation for the detection threshold similar to equations (4.18), 



(4.25) and (4.28), so that the four methods can be compared on an equal 



footing, and second, generating realistic moon detection threshold maps for 
the case of photometric transit timing. 



4.3.3 Microlensing 

The possibility of detecting moons of planets detected by microlensing through 
their perturbations on the microlensing light curve has started to be explored 
in the literature. An initial investigation into whether or not moons of plan- 
ets detected by microlensing could be detected suggested that detection was 



unlikely due to the finite source effect (Han & Han, 2002). However, a more 



in depth investigation found that Earth-sized moons may be detectable if 
the distance from their host planet is similar to or greater than its Einstein 



radius (Han, 2008). 



4.3.4 Timing 

For the case of planets detected by timing, the possibility exists to detect the 
additional timing perturbation due to planet-moon binarity, given sufficient 
timing sensitivity of the host. Currently the only host stars to display the 
required timing sensitivity are the millisecond pulsar hosts. Unfortunately, 
pulsar planets are rare, with only four discovered to date. Prior to this 
thesis, no analysis has been conducted on time-of-arrival perturbation due 
to planet-moon pairs. However, many other second order signals, such as 



the effect of the 2:3 resonance in the PSR B1257+12 system (Wolszczan 



However, equation (15) was derived assuming that there is no transverse motion of the 
planet and moon, which is nearly true at maximum separation, but may not be true at 
other times, especially if the planet and moon are close. Consequently, it is not apparent 
how this effect will modify the sinusoidal signal. 
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1994), free precession (Link, 2003) and Shapiro delay (Ord et al. 2006) have 
been detected. Consequently, moon detection using this technique may be 
possible. This technique will be analysed in detail in Part [IT] of this thesis, 
with the aim of determining which moons (if any) of the four pulsar planets 
may be detectable. 



4.3.5 Detecting moons of imaged planets 

For planets which have been directly imaged we have a a dot, attributed to 
the "planet", which is physically separated from a majority of the light from 
the star. However, as we do not yet have the technical capability to spatially 
separate this dot into an image of a planet and a moon for the case of Solar 
System planet-moon analogs, this dot is really a combination of light from 
a planet, and light from any moons of that planet. While moons cannot 
be directly detected, a number of techniques have been proposed that use 
light from this "planetary" dot to search for companion moons. These pro- 



posed techniques include using photometry ( 


Moskovitz et al. 


& Schneider 


2007 


) astrometry (Cabrera & Schneider 


2007) 


formation ( 


Williams &; Knacke 


2004. 


Cabrera & Schneider 



the presence of moons. 

The photometric effect of a moon on the light curve from a directly 
detected planet has been investigated in two different ways. First, the effect 
of moon-like satellites on the infra-red light curve of Earth-like planets was 



investigated by Moskovitz et al. (2009). They found that, as a result of 



a degeneracy between the effect of a moon and the effect of an inclined 
planetary spin axis, a TPF-like mission could only detect large (Mars-sized) 
satellites of terrestrial planets. In addition to the effect of the bulk motion 
of the planet-moon system about the star on the light curve, Cabrera & 



Schneider ( 2007 ) have also investigated the effect on both optical and infra- 
red light curves of discrete mutual events such as the moon eclipsing or 
casting a shadow on the planet and visa versa. They suggest that such 
events could allow for the detection of lunar-sized moons of Earth-analogs 
by missions similar to TPF-C. 

In addition to using "planet" light curves to detect moons, moons may 
also be detectable from perturbations in the position of the photocenter 
of the dot attributed to the planet. This method was also investigated 



by Cabrera & Schneider (2007), and the possibility of detecting Earth- like 



moons of gas giant planets using this method was addressed in the science 



case for the ELT (Hook 2005). 



Finally, moons of directly detected planets may also be detected us- 
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ing spectra. This can involve measuring the Doppler shift of the planet's 



spectrum due to its motion about the planet-moon barycenter (Cabrera & 



Schneider, 2007) or by noticing that Earth-like moons are much brighter in 



the 1-4/iin CH4 hole, than host gas giants in the habitable zone (Williams 



k Knacke, 2004). 



As yet, none of these techniques have been used to place limits on moon 



sizes for any of the seven directly detected planets. However, (Kalas et al. 



2008 ) report that the optical emission from the planet Fomalhaut b is consis- 



tent with an extended circum-planetary disk, the size of the Galilean satellite 
system. This lends support to the idea that moon systems may exist about 
such planets, and may, in future, be detectable. 



4.4 Conclusion 

To provide a context for extra-solar moon detection, the techniques of extra- 
solar planet detection were discussed in terms of the physics behind the tech- 
nique, their effectiveness, and the types of planets discovered using them. 
This discussion was then extended to include moon detection, with particu- 
lar emphasis on moons that can be detected through the pulsar timing and 
transit techniques. From this position we can now begin to discuss the first 
of the two moon detection techniques that will be investigated in this thesis, 
pulsar timing. 
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Part II 



Detecting Moons of Pulsar 

Planets 
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Chapter 5 



Possibility of detecting 
moons of pulsar planets 
through time-of-arrival 
analysis 



Authored by: Karen M. Lewis, Penny D. Sackett 
and Rosemary A. Mardling 



This chapter is a reformatted and expanded version of the paper: 
K. M. Lewis, P. D. Sackett, R. A. Mardling, 2008, "Possibility of Detecting 
Moons of Pulsar Planets Through Time-of- Arrival Analysis" , Astrophysical 
Journal Letters, 685, L153-L156. 



5.1 Abstract 

The perturbation caused by planet-moon binarity on the time-of-arrival sig- 
nal of a pulsar with an orbiting planet is derived for the case in which the or- 
bits of the moon and the planet-moon barycenter are both circular and copla- 
nar. The signal consists of two sinusoids with frequency (2n m — 3n p ) and 
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(2n m — n p ), where n m and n p are the mean motions of the planet and moon 
around their barycenter, and the planet-moon system around the host, re- 
spectively. The amplitude of the signal is the fraction sin I p [9(M p M m )/16(M p + 
M m ) 2 ][r m /r p ] 5 of the system crossing time r p /c, where M p and M m are the 
the masses of the planet and moon, r m is their orbital separation, r p is 
the distance between the host pulsar and planet-moon barycenter, I p is the 
inclination of the orbital plane of the planet, and c is the speed of light. 
The analysis is applied to the case of PSR B1620-26 b, a pulsar planet, to 
constrain the orbital separation and mass of any possible moons. We find 
that a stable moon orbiting this pulsar planet could be detected, if its mass 
was >5% of its planet's mass, and if the planet-moon distance was ~2% of 
the planet-pulsar separation. 



5.2 Introduction to extra-solar moons 



In the past decade and a half, over three hundred extra-solar planets have 
been discovered. 1 With the data expected to be produced by satellites such 



as COROT (Auvergne et al. 2003) and Kepler (Basri, Borucki, & Kock 
2005), it will not only be possible to find smaller planets, but moons of 
those planets as well (Szabo et al. 2006). As a result, the detectability 



of extra-solar moons is starting to be explored in terms of their effect on 



planetary microlensing (Han & Han, 2002) and transit lightcurves (Sartoretti 



& Schneider 


1999; 


Szabo et al. 


2006 


Simon et al. 


2007) 



Upper limits have 

already been placed on the mass and radius of putative moons of the planets 



HD 209458 b (Brown et al 



HD 189733 b (Pont et al. 



2006) and 



2001] ), OGLE-TR-113 b ( |Gillon et al. 
2007X 

While the limitations of microlensing and the transit technique for de- 
tecting moons have been discussed and used in the literature, the limitations 
of other techniques such as the time-of- arrival (TO A) technique have not. 
This technique involves determining the variations in line-of-sight position 
to the host star, usually a pulsar, using the observed time of periodic events 
associated with that host. The aim of this analysis is to explore what the 
TOA signal of a planet-moon pair is, and relate it to the planetary systems 
that can give the most precise timing information, those around millisecond 
pulsars. 



1 See, for example, http://exoplanet.eu/catalogue.php 
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5.3 Review of planetary detection around millisec- 
ond pulsars 



The first planetary system outside the Solar System was detected around the 



millisecond pulsar PSR 1257+12 (Wolszczan Sz Frail, 1992). This detection 



was made by investigating periodic variations in the time of arrival of its 
radio pulses using a timing model. An example timing model for the case 
in which the planet's orbit around the pulsar is circular is 

{t N ~ to) = (T N - T ) + AT C + AT R + R e .n/c + AT S + AT„ e 

+ TOA pert , p {M s , M p , r p , I p , f p {0)), (5.1) 



(for example, Backer 1993) where to and ijv are the times the initial and 
N th pulses are emitted in the pulsar's frame, To and Tjv are the times the 
initial and N th pulses are received in the observatory's frame, and where 
the terms ATc, ATr, R e .n/c, AT5 and AT„ e act to change the frame of 
reference from the observatory on Earth to the barycenter of the pulsar 
system. The terms ATc and ATr are clock correction terms. ATc converts 
the time recorded by the observatory atomic clock to terrestrial proper time 
while ATr contains time dilation corrections due to the transverse doppler 
effect and gravitational red-shift due to the Earth's motion through the 
gravitational potential of the Solar System. The term R e .n/c, corrects for 
the annual motion of the earth. AT5 and AT„ e correct for propagation 
effects, namely, variations in the amount of ray bending due to gravitational 
fields both inside and outside our solar system, and varying electron density 
along the line of sight to the pulsar respectively. The final term represents 
the effect of a planet on the motion of the pulsar, where r p is the planet- 
pulsar distance, I p is the angle between the normal of the planet-pulsar orbit 
and the line-of-sight, M s and M p are the mass of the pulsar and the planet 
respectively, and f p (0) is the initial angular position of the planet measured 
from the x-axis, about the system barycenter. 

Currently, four planets around two millisecond pulsars have been dis- 



covered, three around PSR 1257+12 (Wolszczan & Frail 



1994) and one around PSR B1620-26 (Backer, Foster, k Sallmen, 1993). 



1992 



Wolszczan 



These four planets include one with mass 0.02 Earth masses, the lowest 
mass extra-solar planet known. This high timing precision of millisecond 
pulsars indicates that they are optimal targets for planet, and consequent 
moon searches. 
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Figure 5.1: Coordinate system used in the analysis of the TO A perturbation 
caused by a moon. The left diagram shows the quantities used to describe 
the position of the three bodies in their mutual orbital plane, while the right 
diagram shows the relationship between this orbital plane and the observer. 



5.4 What is the TOA perturbation caused by a 
moon? 

In order to investigate the perturbation caused by planet-moon binarity, 



the timing model presented in equation (5.1) must be updated to include 
effects due to the presence of the moon. For simplicity, we consider here 
only systems in which both the orbit of the planet and moon around their 
common barycenter, and the orbit of the planet-moon barycenter around 
the pulsar, are circular and lie in the same plane. The resulting updated 
model is 

(t N ~ to) = (Tjv - T ) + AT C + AT R + R e .n/c + AT S + AT„ e 
+ TOA perttP (M s , M p , r p , I p , f p (0)) 
+ TOA pert)Pm {M s , M p , M m , r p , r m , I p , / p (0), / m (0)). (5.2) 

We have explicitly modified TOA per t )P to indicate that it depends on the 
combined planet-moon mass, and included another term, TOA pert:Pm , to 
account for planet-moon binarity. Here M m is the mass of the moon, r m 
is the distance between the planet and the moon, and / m (0) is the initial 
angular position of the moon measured from the x-axis, about the planet- 



moon barycenter (see figure 5.1). TOA perttPm can be derived from R s , the 
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vector between the system barycenter and the pulsar, using 



1 



t rt' 



R s • ndt"dt' = TO A 



pert,p 



+ TOA 



pert,pmi 



(5.3) 



c jo Jo 

where c is the speed of light and n is a unit vector pointing along the line 



of sight. From figure 5.1 we have that 



n 



sin I p i + cos I p k, 



(5.4) 



where the vectors i, j and k are defined in figure 5.1 



The governing equation for R s can be written as the sum of the zeroth 
order term, which describes TOA pertjP and the tidal terms, which describe 
TO ' Ap er tp m , 



d 2 R s _ G(M P + Mr, 
dt 2 r£ 



-r„ + 



M m + M p + M & 
M m + M v 



(5.5) 



where the second term is the tidal perturbation to the orbit due to the 
presence of the moon, which has been written in terms of the disturbing 



function 1Z (for example Murray & Dermott, 1999, p. 226), where r p 
Xi + Yj + Zk, 



(5.6) 



and where 



n 



M m + M p 
' M m + M p + M s 



G {M m + M p ) 



GM P 

M m +M p lm 

GM m 



r v + 



(5.7) 



The disturbing function can be expanded using multipole analysis (for ex- 



ample Jackson 1975 p. 92) in terms of Legendre polynomials. Assuming 



r m < r p , the expansion can be truncated to order r^/r^ giving 



n 



M m + M p GM m M p r*l _ _ 

M m + M p + M s M m + M p 7-3 2 Um JP> ' 



(5.8) 



As the orbits are both circular and coplanar, r m and r p are both constant, 
f p (t) = n p t + f p (0), and f m (t) = n m t + / m (0) where n p and n m are the 
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mean motions of the two respective orbits and are both constants, and the 
inclination, I p , describes the planes of both orbits. Using polar coordinates 
in the plane of the orbit to evaluate V rp 1Z we have, 



d 2 K s 

dt 2 



n 



sin 



G(M p + M m ) , 
Ip 5 r v cos jp + sin Ip 



GM p M m r 2 m 



6 /3 9 

- sin(2(/ p - f m )) sin f p + ( - + - 



(M m + M p ) r4 
cos(2(/ p - f m )) ) cos/p 



(5.9) 



So that from equation (5.3), 



TOApert^pm 

3 



[n. 



s\nIpGMpM m r A m 
c(M m + M p ) r4 

2 cos(/ p - 2/ m ) + - 



4n2 



cos/p 



15 



(3n p - 2n m ) 2 



cos(3/p - 2/ m ) 



(5.10) 



The cos f p term in equation (5.10) has the same frequency as the signal of a 
lone planet and it acts to increase the measured value of M p + M m derived 
from TOA pert ,p by (3/4) (r 2 m /rl){MpM m / (M p + M m )). Consequently, this 
term can be neglected as it will be undetectable as a separate signal. Also, 
the stability region for a prograde satellite of the low-mass component of 
a high-mass ratio binary extends from Vr, the Roche radius, to 0.36r# for 
the case of circular orbits, where rn = f p [(M P )/(3M S )] 1 ^ 3 is the secondary's 



1999). As moon detectability increases as 



Hill radius (Holman & Wiegert 

r^, and is equal to only a few planetary radii, this limit can be safely 
ignored. When r m is equal to 0.36r#, n m « 8n p . As n p <C n m is likely, 
we have that the denominators of the cos(/ p — 2/ m ) and cos(3/p — 2f m ) 
terms will never approach zero. This, in addition to the assumption of zero 
eccentricities, means that resonance effects can be neglected. Consequently, 
equation (5.10) can be simplified by neglecting n p in the denominators, 
giving 



TOA 



sin I„GM n M, 



pert,pm 



m i m 



c(M m + Mp 



32n^ 



cos(/ p - 2f m ) 



15 



32nH. 



cos(3/p - 2f n 



(5.11) 
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Writing n p in terms of r m , using Kepler's law, gives 



TOA 



pert,pm 



. T M p M m 
sin In 



p {M m + M p f c 



32 



cos(/ p - 2/„ 



15 

+ ^cos(3/ p - 2/„ 



, (5.12) 



where we are making no assumptions about the size of M m /M p . 



A similar study was conducted by Schneider Sz Cabrera (2006), invest! 



gating the radial velocity perturbation due to an equal-mass pair of binary 
stars on a distant companion. Converting their radial velocity perturbation 



to a timing perturbation, setting M p 
equivalent to r m /2, our results agree. 



M m , and noting that their a a is 



5.5 Is it possible to detect moons of planets orbit- 
ing millisecond pulsars? 

To investigate whether or not it is possible to detect moons of pulsar planets, 



we simplify equation (5.12) by summing the amplitudes of the sinusoids, 



giving the maximum possible amplitude 



max {TOA. per f tPm ) 



9 sin I p M m M p r p 
16 (M m + M p ) 2 7 



(5.13) 



Thus, the size of the perturbation varies as [M m M p / (M m + M p ) 2 ][r m /r p ] 5 
times the system crossing time, r p /c. So, the best hope of a detectable 
signal occurs when the planet-moon pair widely are separated from each 
other, both quite massive, and very accurate timing data is available. For 
example, a stable system such as a 0.1 AU Jupiter- Jupiter binary located 
5.2AU from a host pulsar would produce a TOA pert:Pm of amplitude 960ns, 
which compares well with the 130ns residuals obtained from one of the most 



stable millisecond pulsars, PSR J0437-4715 (van Straten et al. 2001). 

To demonstrate this method, the expected maximum signals from a 
moon orbiting each of the four known pulsar planets were explored. It 
was found that in the case of PSR B1620-26 b, signals that are in principle 
detectable could confirm or rule out certain configurations of moon mass 



and orbital parameters (see figure 5.2). 



In the particular case of PSR B1620-26 b, the perturbation signal will 



not match the signal shown in equation (5.12) due to the effect of its white 
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Moon Mass (Earth Masses) 



Figure 5.2: The regions of parameter space containing detectable (shaded) 
and stable (cross-hatched) moons of the planet PSR B1620-26 b are shown 
as a function of planet-moon separation and moon mass. The total mass and 
the distance of the planet-moon pair from the parent pulsar were assumed to 



be 2.5 Jupiter masses and 23AU respectively (Siggurdsson et al. , 2003]), while 



it was assumed that sin/ p = 1. The mass of the host was set at 1.7 solar 
masses (the sum of the mass of the pulsar and its white dwarf companion) . 
The 3<T detection threshold was calculated assuming the ~ 40/is timing 



residuals given in Thorsett et al. (1999) are uncorrelated and that similar 



accuracy TOA measurements of PSR B1620-26 continue to the present day. 
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dwarf companion. As a side project, this companion's effect was investi- 
gated and found to be the introduction of additional perturbations on top 
of the TOAp er t.p and TOA per t, pm calculated. Consequently, the detection 
threshold represents an upper limit to the minimum detectable signal and 
the analysis is still valid. 

Unfortunately, there are practical limits to the applicability of this method. 
They include discounting other systems that could produce similar signals, 
sensitivity limits due to intrinsic pulsar timing noise, and limits imposed by 
moon formation and stability. 

First, other systems that could produce similar signals need to be inves- 



tigated. Possible processes include pulsar precession (e.g., Akgiin, Link, h 



Wasserman, 2006), periodic variation in the ISM (Scherer et al. 1997), grav- 



itational waves (Detweiler, 1979), unmodelled interactions between planets 
(Laughlin & Chambers, 2001) and other small planets. To help investi- 
gate the last two options, we plan on completing a more in-depth analysis 
of the perturbation signal of an extra-solar moon, including the effects of 
inclination and eccentricity. 2 

Second, the noise floor of the system needs to be examined. The suit- 
ability of pulsars for signal detection is limited by two main noise sources, 



phase jitter and red timing noise (e.g., Cordes, 1993). Phase jitter is error 
due to pulse-to-pulse variations and leads to statistically independent errors 
for each TOA measurement. Phase jitter decreases with increasing rotation 
rate (decreasing P) due to the increase in the number of pulses sampled each 
integration. Red timing noise refers to noise for which neigbouring TOA 
residuals are correlated. Red timing noise has been historically modeled as 



a random walk in phase, frequency or frequency derivative (e.g., Boynton 



et al. 


1972; 


Cordes 


1980 


Kopeikin 


1997) 



Red noise is strongly depen- 
dent on P. It has been proposed that red noise is due to non-homogeneous 
angular momentum transport either between components within the pulsar 



(e.g. Jones 1990) or between it and its environment (e.g. Cheng, 1987). To 
illustrate the effect of these two noise sources, an estimat e of the resulting 
TOA residuals as a function of P and P is shown in figure 5.3 For compar- 



ison, the values of P and P of every pulsar listed in the The ATNF Pulsar 
Catalogue 3 (Manchester et al. 2005) are also included. 



Third, whether or not moons will be discovered depends on whether or 
not they exist in certain configurations, which depends on their formation 
history and orbital stability. Recent research suggests that there are physical 



2 See chapter [fi] 

3 http: / / www.atnf.csiro.au/research/pulsar/psrcat/. 
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Figure 5.3: Contour plot of predicted timing noise as a function of pulsar 
rotation period and period derivative. This plot is based on figure 9 from 



Cordes ( 1993 ) . The functions and assumptions used to generate the contours 



are the same as given in Cordes ( 1993 ) , noting that the TOA integrations are 
1000 seconds long. Note that correlated timing noise measured for individual 
pulsars can vary from the predicted values by two orders of magnitude (e.g., 



Arzoumanian et al. 1994). 
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mass limits for satellites of both gas giants (Canup & Ward, 2006) and 



terrestrial planets (Wada et al. , 2006). Also, tidal and three-body effects can 



strongly affect the longevity of moons (Barnes & O'Brien, 2002; Domingos, 



Winter, & Yokohama, 2006 Atobe & Ida, 2007). 



Finally, while this method was investigated for the specific case of a 
pulsar host, this technique could also be applied to planets orbiting other 
clock-like hosts such as pulsating giant stars (Silvotti et al. , 2007) and white 
dwarfs (IMullally, Winget, & Kepler I [20061) . 
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Chapter 6 

Effect of mutual inclination 
and eccentricity on the 
time-of-arrival perturbation 



6.1 Introduction 



As mentioned in Lewis et al. (2008), in addition to considering the case 



where the planet and moon's orbits were circular and coplanar, it would be 
of use to determine the effect of mutual inclination and eccentricity in the 
orbit of the planet and moon, on the time-of-arrival perturbation due to 
planet-moon binarity. As the method used in chapter [5] to investigate the 
time-of-arrival perturbation for the case of circular coplanar orbits cannot 
be easily extended to deal with these cases, a more general method will 
be used. To begin, a set of expansions developed by my PhD supervisor, 
Dr. Rosemary Mardling, will be introduced, which allow the disturbing 
function to be written in terms of the semi-major axis, eccentricity, inclina- 
tion, longitude of the ascending node, argument of periastron and the mean 
anomaly corresponding to the planet and moon orbits. Then, the selection 
of reference plane and direction required for the definition of Euler angles is 
motivated and discussed. Using this coordinate system, the equations defin- 
ing this perturbation are then reformatted such that the expansions can be 
easily applied on a case by case basis. First, the case of circular coplanar 
orbits will be re-investigated, to demonstrate the use of the method and to 
show that the expressions produced in chapter [5] and using this method are 
equivalent. Then, building on this foundation, the effect of mutual inclina- 
tion, low eccentricity in the moon's orbit and low eccentricity in the planet's 
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orbit on the time of arrival perturbation due to planet-moon binarity will 
be investigated in turn. We begin by introducing the expansions that will 
be used to write the disturbing function in terms of the orbital elements of 
the planet and moon's orbits. 



6.2 Writing the disturbing function in terms of the 
orbital elements of the planet and moon 



As can be seen from equations (5.3) and (15 . 51) , the form of the time-of- 



arrival perturbation due to planet-moon binarity is entirely specified by 1Z, 
the disturbing function, and modified by n, the unit vector directed along 
the line-of-sight. Consequently, the ability to write the disturbing function 
in terms of time and the orbital elements of the planet's and moon's orbits, 
corresponds to the ability to determine the time-of-arrival perturbation due 
to planet-moon binarity for any orbital configuration as a function of time. 
For this work I will be using a method pioneered by my PhD supervisor, 
Dr. Rosemary Mardling, which allows the disturbing function to be written 
in terms of these orbital elements and is valid for all values of eccentricity 
and inclination. While aspects of this method are presented in the liter- 
ature (Mardling 2008b|a ), for completeness the fundamental mathematics 



required for this chapter will be summarised. 

To begin, recall that, for this work, the disturbing function is given by 

G(M p + M m )M s GM m M s GM p M s 

n = h \ m r + i m r v 6 - 1 ) 

P \ T P- M p +M m r m I I T P + M p +M m r m I 

While this function can be expanded in terms of Legendre polynomials as 
was done in chapter [5j it can also be expanded in terms of spherical har- 
monics. To do this we use the identity 

1 00 ^ A I 

ib^r = £ E ^^YUOaMYSnfrM, (6-2) 

1 1 1=0 m=-l 

where |a| < |b|, a = |a| and b = |b|, and where 6 a and tp a , and 6b and ipb 
represent the orientation of a and b expressed in spherical polar coordinates. 
In addition, we have that 



yuw) = V ^fr^l pr(cosg)e ^' (6 - 3) 
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z 




where Pj 71 (cos 6) is the associated Legendre polynomial of degree I and order 
m given by 

too = - * 2 r /2 ^(* 2 - 1)', (6.4) 

where 

P r m (x) = (-l) m !^^T(z)- (6-5) 
Using this expansion on 1Z gives 

GM m M p M s ^ ^ 4tt ^ 

^ = M J- M 1^ 1^ 7JT—T M l^l Y lm{0 m ,^m)Y lm {e p ,i;p), (6.6) 

where 

_ a4--(-m ; )'-' 
(M m + Ay'- 1 

and where m , ip m , 9 P and Vv describe the angular orientation of r m and 
v p in spherical polar coordinates (see figure 6.1), and where the monopole 
(I = 0) and dipole (I = 1) terms are exactly zero. 



S3 



Writing the disturbing function in this way, it can be seen that it consists 
of a sum of terms, each comprised of five factors, which are, a constant, the 
distance between the planet and moon raised to the power I, the distance 
between the star and the planet-moon barycenter raised to the power —(1 + 
1), a spherical harmonic depending on the orientation of r m , and a spherical 
harmonic depending on the orientation of r p . Consequently, we need a way 
to express each of these factors in terms of time and the orbital parameters 
of the system. We begin with the spherical harmonics. 

6.2.1 Writing Yi m (6,ip) in terms of /, cu, £1 and / 

Following Mardling (private communication), Yi m (0,if)) can be written in 
terms of the orbital elements /, u), £1 and /, such that, 

I 

Y lm (9,^)= Yl D lmmf (I,u,n)e im ' f , (6.7) 

m'=-l,2 

where I, cj and are the Euler angles specifying the orientation of the orbit 
and where Di mm i(I,ui,Q) are Wigner D-functions, such that 

D lmm , (J,w,n) = (-if +m+m 'Y lm (|,o) 7 W (I)e^ m ' Mn \ (6.8) 

where, for completeness, the inclination functions 7j mm '(I) are tabulated 
in Appendix [C] Applying this expansion to both spherical harmonics in 
equation ( |6.6[ ) and rearranging gives 

_ GM m M p M s ^ ' ' 4tt < r /vr 

1=2 m=-l m'=-L2 m"=-l,2 



M m + Mr, ^ ^ ^ ^ 21 + 1 a l n +1 V V 2 ' 

m P l=2m=-Im'=-l,2m"=-l,2 "P 

v (— ^\2m+m'+m" (j \ t j \ i(m' 'u m +m!! m -ra" Wp-mSlp) 

A V Umm! y-m) Ilmm" \ 1 p) e 



' m im'fn 
I 

a m 



(6.9) 



As can be seen, using this transformation functionally replaces each term 
with a set of new terms each consisting of three factors, which are a constant 
which depends on a m , I m , u m , &>m, %>, Ip, w p and U p , a term which depends 
on r m and f m and a term which depends on r p and f p . These last two terms 
can also be described in terms of ctm > £ m , M m (t), a p , e p and M p (t) using a 
different expansion, where we note that the time dependance differentiates 
between the mean anomalies M m (t) and M p (t), and the masses M m and 
M p . 
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6.2.2 Writing terms of the form {r/a) l e im f in terms of a, e 

and M(t) 

To begin, we take a closer look at the quantities in square brackets in equa- 
tion (6.9). While these terms are not simple functions of time, they are 
approximately periodic, which means that they can be described as Fourier 
series, that is 



\ 1 °° 

j e imf m = 4 im )(e m )e inMm( * ) (6.10) 



m 



n=—oo 



and 

\ l+l oo 
— J e ~ im h= V F« m \e p )e inM "W (6.11) 
r p J 

H ' n=—oo 



where the coefficients Sn m ^(e m ) and Fn lrn \e p ) are functions of the eccentric- 
ity and are defined by 



I f 2lT r l 



2tt 

and 



V m \e m ) = — / -^e imfm e- inMm ^dM m (t) (6.12) 

a m 



1 f^a l+1 



F% m Xe v ) = — I -^e-^e^^dMpit) (6.13) 

-i T"p 



The properties of these coefficients were investigated in Mardling (2008b) 
and are summarised in appendix [Dj Also, the dependence of the coefficients 
si m \e m ) and Fn\e p ) on e m and e p to third order is presented in tables 
ITIT1 and 11X21 



6.3 Selection of the reference plane and direction 

In order to derive expressions for the time-of-arrival perturbation due to 
planet-moon binarity, a reference plane and direction required for the defi- 
nition of the Euler angles /, u and needs to be selected. 

To simplify the expression for 1Z we choose the coordinate system such 
that the x-y plane coincides with the planet's orbit. For this choice we do not 
need to expand the spherical harmonic corresponding to the planet's orbit as 
Op = tt/2 and ip p is equal to the sum of f p , uj p , O p and an additive constant 
which depends on the reference direction. Unfortunately, an unwanted side 
effect of this choice is that the coordinate system is no longer inertial, and 
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(a) Coordinate system for planet's orbit. (b) Coordinate system for moon's orbit. 

Figure 6.2: Diagram showing the definition of the Euler angles I p , O p , uj p , 
Imi and co m used in this chapter, where we define w p = uj p + Q p . In 
particular note that n m is the orbit normal to the moon's orbit while n is 
directed along the line of sight 



any process which acts to alter the orientation of the planet's orbit acts 
to alter the orientation of the coordinate system. This issue is beyond the 
scope of this thesis, but will be briefly discussed in chapter [TT] in context of 
directions for future research. We now move onto the selection of reference 
direction. 

For this work, the x-axis was selected as the reference direction, such 
that, for our choice of reference plane i\) p = f p + u p + Q p . In addition, we 
choose the x-axis to correspond with the projection of the line between the 
observer and the system barycenter onto the plane of the planet's orbit. 
This decision was made to reduce the number of non-zero components in n. 
For reference, these definitions are summarised in figure [6\2| 

These two decisions have a number of ramifications, especially for the 
definition of the inclinations I m and I p . These will be briefly highlighted. 
First, as the reference plane is the planet's orbit, the Euler angles for the 
moon's orbit are measured relative to the planet's orbit (see figure 6.2(b)). 
In particular I m represents the mutual inclination between the orbit of the 
planet and that of the moon, such that I m = implies that the orbits are 
coplanar and I m 7^ implies that they are not. Second, for the planet's 
orbit, the situation is a little more involved. As the orbital plane of the 
planet is the reference plane, using the standard definition of inclination, 
the planet's orbit would have zero inclination by definition. Consequently, 
following Lewis et al. (2008), for this chapter we define I p to be the angle 
between n, the vector along the line of sight and the normal to the planet's 
orbital plane (see figure 6.2(a)). Now that a coordinate system has been 
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selected and discussed, we can use the methods introduced in section 6/2 
to produce a more useful form for the expression for the time-of-arrival 
perturbation. 



6.4 Derivation of the general equation 

We begin by deriving a general expression for the time-of-arrival perturba- 
tion, to which the expansions described in section 6.2 can be applied where 
appropriate. To start consider equation (5.3), 

*t rt 1 



o J o 



R s • ndt'dt' = TOA pert)P + TOA pert 



where we recall that TOA per t tP is the time-of-arrival perturbation due to 
orbit of the the planet-moon system about the pulsar, TOA per t tPm is the 
timing perturbation due to planet-moon binarity, and R s is the vector from 
the system barycenter to the star. From chapter [2j we have that r p is 
the vector from the planet-moon barycenter to the star, and thus R s = 
\{M p + M m ) I (M p + M m + M s )]r p . Using this expression and equation (2.15) 
we then have that 



TOA 



pert,pm 



c M s J J 



dU 



• ndt'dt, 



(6.14) 



where we note that a different definition of 1Z is used in Lewis et al. ( 2008 ) 
than used in this chapter. Expanding the disturbing function using equa- 
tion (6.6) gives 



TOA 



pert,pm 



1 1 

cMl 



Jo 



d_ 
dr T1 



' GM m M p M s ^ ^ 



M m + M, 



p 



1=2 m=-l 

x-^iYi m (e m ,ipm)Y* m (9 m ,ip m ) -ndt'dt, 



4tt 
21 + 1 



Mi 



(6.15) 



1 GM m M p 



cM m + M v 



t rt' 



p JO Jo 



21 + i r ; 

2 m=-l 'P 



{l + l) Yl * m {e p ^ p )e rp + 



dY 2 * m (6 p ,^ p ) 



'V- 2 



-imY^ m (6 p ,ip p )e^ p 



■ ndt'dt, 



(6.16) 
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where we note that the expression 9 p = tt/2 has been used. Using the fact 



that r m /r p <C 1, we retain the I = 2 terms only, 1 equation (6.16) simplifies 
to 



TOA 



pert,pm 



ft 

cM m + M p J J 



1 GM m M p 



2m\ l/ pi' l l J p) e rp + 



\m=-2 P 
dY* m {6 p ^p) 



'P— 2 



ndt'dt. (6.17) 



where we note that M2 = 1. We now consider the definition of n. As 
the direction of the unit vectors in spherical polar coordinates depends on 
position, it would be useful to convert n into Cartesian coordinates. Using 
the fact that 8 p = ir/2 we have that 



>x p = cosip p e rp - smip p e^ p , 
i Vp = sin^pe^ + cosi> p e^ p , 



(6.18) 
(6.19) 
(6.20) 

We now write the three components of n, n Xp , n Vp and n Zp , in terms of the 
quantities of the system. Comparing with figure 6.2(a) and noting that n 
lies in the x-z plane gives 



n 



Vp 



sin I p , 
0, 

cos L 



V 



(6.21) 
(6.22) 
(6.23) 



where I p is the angle between the line-of-sight and the vector normal to the 
planet's orbit. 

So, using this notation we have that 

n = [cos ip p e rp - sin ip p e^ p ] sin I p + + [-e 6p ] cos I p , (6.24) 



x For the case of the four known pulsar planets PSR 1257+12 b, PSR 1257+12 c, 
PSR 1257+12 d and PSR B1620-26 b, the highest values of r m /r p allowed by orbital 
stability are 0.0012, 0.0034, 0.0033, and 0.039, where we note that the ratio for PSR B1620- 



26 b was calculated under the unrealistic ( Ford et al. 2000 Siggurdsson & Thorsett 2005 1 



assumption that its orbit is not eccentric. Consequently, for these four cases the / = 3 
terms will be at least 0.0012, 0.0034, 0.0033, and 0.039 times smaller than the I = 2 terms 
respectively, and can thus be neglected. 
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which is equivalent to 



n = sin I p cos ip p e rp - cos I p e dp - sin I p sin tp p e^ p 



(6.25) 



The two equations we will require are equations (6.17) and (6.25). These 
equations will form the basis for the following analyses into the perturbation 
in the case of circular coplanar, inclined and eccentric orbits. 



6.5 Circular coplanar orbits 

To begin the investigation into the form of the time-of-arrival perturbation 
due to a moon, we revisit the case of circular and coplanar planet and moon 
orbits. First, we must write the spherical polar angles representing the 
position of the planet (8 P and ip p ) and the moon (6 m and ifi m ) in terms of 
the angles f p , 



p and Q p and /„ 



and J7 m . As mentioned in section 6.3 



the orbit of the planet is defined to lie in the horizontal plane defined by 
Op = it/ 2. As the orbit of the moon is coplanar with that of the planet, we 



also have that m = ir/2. Finally, recalling that the x-axis is the reference 
direction for both the spherical polar coordinate system and the definition 
of longitudes, we have that ip p = f p + uj p + Q p and ijj m = f m + uj m + U m . 
Substituting these values into equation (6.17) and noting that i^ m (0) = 

for m + 1 even, this simplifies 



for m + l odd and dP™ (cos 9 p )/d6, 
to 



pi cos e v =o 



TOA 



pert,pm 



1 GM m M p 



t rt' 



E 



(2 — m) \ r: 



l r 2 

m 



< -H„ + A/„ ./,, ./,, \ ^-i„ (2 + m)! rj 

(P 2 m (0)) 2 (iMfrn+TZm-U-TZp) [-^ - iljl^ p ] ) ■ ndt'dt, (6.26) 



where the expressions w p = uj v + Q p and zu m = uj m + Q m have been used. 
Expanding the sum and rewriting the complex exponentials in terms of 
sinusoids then gives 



TOA 



pert,pm 



1 GM m M p /•* /•*' fr 2 m 



c M m + M p Jo Jo 

-9 3 
— cos(2/ m + w m - f p - Wp) - - 



+ - sin(2/ m + w m - f p - w p )e^ p 



ndt'dt. (6.27) 
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Writing n out in full using equation (6.25) and combining like terms gives 



! GM m M pS mI p [* [* rj ( 3 
TOA pert , pm = -- Mm + Mp J q J Q ^{- 4 - (/„ + * p ) 

15 

— - - cos(2/ m + 2w m - 3/ p - 3w p ) 



cos(2/ m + 2uj m — f p — Wp) \ dt'dt. (6.28) 



As both the planet and moon orbits are circular, r m = a m and r p = a p , and 
thus these terms can be moved outside the integral as they are constant. 
Also, we have that, = n p and = n m , where n p and n m are the mean 
motions of the planet and moon respectively, are also constant. Taking 
advantage of these simplifications and performing the two integrals gives 



1 GM m M p sin I p / 3 



2 

TOA pert , pm = ---^-P-^^ ^ C os (f p + w p ) 



+ 8(2n m 1 -3n p )2 C ° s(2/ ™ + 2Wm ~ ^ ~ ^ 



3 

8(2n m - n p ) 2 



+ ^7^Z cos ( 2 fm + 2ro m - f p - w p ) ) . (6.29) 



Noting that f p + w p and f m + w m in this work is equivalent to f p and /, 
in chapter [5| this is exactly the same expression as given in equation (5.10). 



Also, as pointed out in chapter [5| the first term of equation (6.29) cannot 



be distinguished from TOA per t t p, the the time-of-arrival signal due to the 
orbit of the the planet-moon system about the pulsar, as they have the same 
angular frequency, and consequently it can be neglected. 

For reference, a realisation of TOAp er t^ prn calculated for an example 



planet-moon pair corresponding to PSR B1620-26 b is presented in figure 6.3 
While data corresponding to a full orbit of PSR B 1620-26 b is not available 
as the pulsar has only been observed for a little over 20 years, and the orbital 
period is of the order of a century, the orbital elements can be constrained 
by measuring the period derivatives and measuring perturbations on the 



orbit of the white dwarf companion (e.g. Joshi Sz Rasio[ 1997). Conversely, 
the perturbation due to planet-moon binarity causes timing variations over 
much shorter timescales, and recalling from chapter [5] that the timing errors 
are of the order of 40/is, this perturbation is potentially detectable. 



In particular, as can be seen in figure 6.3 for the case of circular and 



coplanar planet and moon orbits, TOAp er t y pm looks like a high frequency 
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Figure 6.3: Functional form and frequency composition of the time-of-arrival 
perturbation due to planet-moon binarity for the case where the planet 
and moon's orbits are circular and coplanar. This curve was calculated for 
the case of a PSR B1620-26 b analog, in particular, it was assumed that 
M p = 2.3Mj, a p = 23AU, M m = 0.2Mj and a m = 0.8AU. 



sinusoid which has been multiplied by an envelope function. While this is not 
strictly mathematically the case, in that TOA per t tPm is given by the sum of a 
sinusoid and a beat function, this analogy is intuitively useful. Noting that 
the sinusoids with frequency fx = 2n m — n p and fi = 2n m — 3n p are causing 
the beating in TOA per t tPm , and recalling that the frequency of the envelope 
function for a beat is given by (/x — /2V2 = n p while the frequency of the 
high frequency sinusoid that it modifies is given by (/1 + /2V2 = n m — n p , 
it can be surmised that the envelope function is defined by the effect of the 
planet's orbit and the high frequency sinusoid defines the effect of the moon's 
orbit. We now move to the first of the more complex cases analysed in this 
chapter, the case where the orbits of the planet and moon are mutually 
inclined. 



91 



6.6 Mutually inclined orbits 



As discussed in chapter [3j moons are more likely to form or be captured into 
orbits which are roughly coplanar with that of their host planet. However, 
highly inclined moon systems do form, for example, the satellite system of 
Uranus. Consequently, it is of scientific interest to investigate the effect of 
mutual inclination in two regimes. First, the effect of slight orbital misalign- 
ment will be investigated by deriving an expression for the time-of-arrival 
perturbation correct to order sin/ p . Then the case of arbitrary orbital mis- 
alignment will be considered. However, before either of these cases can be 



investigated, equation (6.17) must be written in terms of the orbital ele- 
ments. 

As in the previous case, we begin by writing the angles 9 P , tp p , 9 m and ip m 
in terms of the orbital elements f p , u p , Q p , f m , uj m and $7 m . As for the case 
where the orbits were coplanar, we have that 6 p = tt/2 and ij) p = f p +uj p +U p , 
however, as the moon's orbit is now inclined, writing 9 m and ip m in terms 
of f m , ui m and Q m is a little more challenging. This leads to two issues that 
must be resolved before analytic progress can be made: 

1. The inclusion of terms of the form dYi m {9 p , ip p )/d9 p \ =7j y 2 in the sum. 

2. The time-of-arrival perturbation is no longer independent of 9 m . 

The solution to both of these issues is to expand the term in question, 
however, the way this is done is slightly different for each of the two cases. 

6.6.1 Recasting terms of the form 9Ylm ^p'^ 

To begin, we note that there are a number of recurrence identities which 
relate Legendre polynomials of different orders. One such identity is 

sm9 p ^^- = I cos OpPj 71 (cos 6 P ) _ (/ + m )P™ 1 ( CO s9 p ). (6.30) 

d9 p 

This identity is valid for both positive and negative m. Substituting in 
9 p = 7r/2, gives 

dPp 1 (cos P ). 



d9 p 



= -(i + m)J^ 1 (0). (6.31) 

?p=7T/2 



Comparing this with equation (6.3), it can be seen that 
9Yi m (e p ,tp p 



d9 p 



?p=7T/2 



'(2/ + l)(/-m) / , , (ir 
(21- 1) (I 



— Ill / 7T \ 

T ^(/ + m)y a _ 1)m (-,^j. (6.32) 



92 



Substituting equation (6.32) into equation (6.17) and noting that I = 2 gives 



1 GM m M p 

UA pert<pm - cMm + Mp 



t pi' 

o Jo 



A — j,1 

2 -^Y 2m {e m ^ m ) 



5 

\m=-2 P 



7T 



3>?m (nip S 



'5(2 - m) 
3(2 + m) 



;2 + m)y; m e 6p 



7T 



-imYr 



2 m 



ndi'di. (6.33) 



6.6.2 Describing a rotated moon orbit 



As the moon's orbit no longer lies in the same plane as the planet's orbit, 
its orbital plane is no longer given by m = tt/2. To deal with this in- 
clined orbit, the expansion described in section 6.2 will be employed. Using 



equations (6.7) and (6.8) to expand equation (6.33) gives 
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1 GM m M p 



cM m + M v 



t r t' 



p JO JO 



E E 



i-i) 



m+m 1 



, m=-2m'=-2,2 p 



(2-m')! 




>' pro' (Q) e *( m ' fm+m'wm-mfp-m-njp) e imU n 



(2 + m)! 



P2m(0)e ^ + v §rS (2 + m)Pim(0)e ^ 



ndi'cft. (6.34) 



where, for easy reference, a table of Ji m m'(lm) functions is given in ap- 
pendix [C] Armed with equation (6.34), we are now in a position to calculate 
the form of the perturbation for the cases of small and arbitrary amounts 
of misalignment. 



6.6.3 Solution in the case of circular orbits and small mutual 
inclination 

For the case where sin/ m , the sine of the relative inclination is near zero, 
we have that 

cos/ m « 1 - - sin 2 I m . (6.35) 
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Figure 6.4: Schematic diagram of the orbital orientations of the moon and 
planet orbits at four stages of the "year" . 



As a result of the prediction that low inclinations should be common, it 
would be interesting to only include the terms of order sin I m and investigate 
the signal form. As "fimm'ilm) are the terms which contain I m , a quick 
investigation of the table in Appendix |C| shows that only ^lm' (Im) an d 
72—im'(Im) have terms that are first order in sin/ m . Also, to first order in 
sin/ m , all other terms are exactly equal to the values found in the circular 
coplanar case, which allows us to write 



TO A -TOA 1 [* f GM m M p ' * r_l 

UA pert , pm - OA pert , cc - y Q y Q ^ + Mp ^ 

. 1(2 — m')\(2 — m)\ , , , 

x Tw lm )\ 9 , , , L , , 2 + m P 2 m Pr 
y (2 + m')][Z + my. 

x (_j)">+m' e i(m'(/ ra+Uro )-m(/ J+E 7,)) e imn me ^ . ^dt'dt. (6.36) 



where TOA per t tCC is the perturbation TOA per t )Prn for the case of circular 
coplanar orbits. Expanding the sum, substituting in the values of r )2mm l (Jm) 
to order sin/ m from Appendix [Cl the expression for n from equation (6.25) 
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and combining the complex exponentials into sinusoids gives 



1 /"* /"*' GM m M p cos I p 



TOAp er ip m — TO Ap er i cc it 1 1 i > 

c JO Jo M m + -'Wp ' p 

x ^| sin 7 m sin(2/ m + 2u m - f p - m p + fi m ) 
3 

+ - sin I m sin(2/ m + 2w m + f p + w p - O m ) 

-~ sin/ m sin(/ p + w v - VL m )^ dt"dt' . (6.37) 

Noting that as the orbits are circular, r p = a p , r m = a m , ^ = n p and 
^jr = n m , the integration can be performed to give 

1 GM m M n sin I m cos /„ al 



TO A — TD A - Z " — m±v± V 0111 ±m UUJ A P u m 

J- <^s±pert,pm — -L ^J-t\pertfic , , , , 4 

C -i"m ~r lrlp Up 

3 



sin(2/ m + 2oj m - f p - Wp + Q, r 



4(2n m - n p ) 2 
3 

- J7^ ^ sin ( 2 /™ + 2uJ ™ + fp + ^p- ^m) 

<±(2n m + Tip j 

sin(/ p + w p - £2 m ) J . (6.38) 
2n^ y 

As can be seen above, low inclination results in the inclusion of one 2 addi- 
tional frequency in the solution. It is interesting to note that this additional 
signal looks like a beat function. 

To see why, consider how the position of the planet and moon relative 
to the pulsar changes over a moon orbit as a function of the position of the 



planet-moon pair about the planet's orbit (see figure 6.4). During the two 



sections of the planet's year when the moon's orbital plane is aligned with the 



vector pointing to the pulsar (stage 1 and 3 in figure 6.4), the moon moves 
from being a p + (M p /(M m + M p ))a m away from the pulsar to being a p — 
(M p / (M m +M p ))a m away from the pulsar, while the planet moves from being 
Op + (M m / (M m + Mp))a m away from the pulsar to being a p — (M m / (M m + 
M p ))a m away from the pulsar during a single moon orbit. As these are 
the same values as for circular coplanar orbits it should be unsurprising 



2 Terms with frequency 2n m — n p and n p are already present in the expression corre- 
sponding to the circular coplanar case. Consequently, the term with frequency 2n m + n v 
corresponds to the only "new" frequency. 
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that these two times correspond to the zero of the envelope function of 
the beat. Conversely, when the moon orbit is more face on to the pulsar 
(stage 2 and 4 in figure 6.4), the moon moves from being (a 2 + (M p /(M m + 
M p )) 2 a^ + 2a p (M p / (M m + M p ))a m cos I m ) 1 ^ 2 away from the pulsar to being 
(a 2 + (M p /(M m + M p )) 2 a 2 m - 2a p (M p /(M m + M p ))a m cos I m ) l l 2 away from 
the pulsar while the planet moves from being (a 2 + (M m / (M m + M p )) 2 Om + 
2a p (M m /(M m + M p ))a m cos Im) 1 / 2 away from the pulsar to being (a 2 + 
(M m / (M m + M p )) 2 a 2 m - 2a p (M m /(M m + M p ))a m cosI m ) 1/2 away from the 
pulsar during one moon orbit. As these values are the most different from 
those for circular coplanar orbits, it should be unsurprising that it is at these 
times that the perturbation to the circular coplanar signal is greatest. We 
now explore the effect of arbitrary mutual inclination. 



6.6.4 Solution in the case of circular orbits and arbitrary 
mutual inclination 

For the case of arbitrary mutual inclination we no longer neglect the higher 



order terms in sin/ m . Taking equation (16. 341), expanding the sum, collecting 
the complex exponentials into sine and cosine functions and substituting 
in the expressions for ^imm'(Im), where we note that that ^yi mm '(I m ) = 



(-l) m + m 'r ri _ m _ mf (I m ) (see Appendix[c]), gives 

rt ft' „2 



1 GM m M p [* f r 2 m ( 3 2 
TOA pert , pm = --— —J q J q ^(-^cos I m -l)e rp 
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- — (1 + cos/ m ) cos(2/ m + 2w m - 2f p - 2w p )e rp 
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--(1 + cosI m ) 2 sin(2/ m + 2w m - 2f p - 2w p )e^ p 
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+ - sin/ m (l + cos Im) sin(2/ m + 2uj m — fp — ^p ~\~ Qm)&6 p 
3 

+ - sin/ m (l - cos/ m ) sin(2/ m + 2w m + f p + w p - Q. m )e ep 

-- sin 2 I m cos(2/ p + 2w p - 20 m )e r 
o 
3 

+ - sin 2 I m sin(2/ p + 2w p - 2Q m )e^ 
o 
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- (1 - cos I m ) 2 cos(2/ m + 2uj m + 2f p + 2w p - 2Q m )e rp 
9 

-- sin I m cos(2/ m + 2w m )e r 
o 

3 

+ -(1 - cos/ m ) 2 sin(2/ m + 2u m + 2f p + 2w v - 2Vt m )e^ p 

+ | sin(2J m ) sin(/ p + tu v - O m )e 0p ^ • ndt"dt'. (6.39) 



Substituting in equation (6.25) and collecting like coefficients gives 



1 GM m M p /•* y 1 
Peri ' pm " cM m + M p J J 



1 m 
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--(3cos 2 / m - l)sinI p cos(/p + ro p ) 
V ° 
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+ 3 ^( 1 + cos/ m ) 2 sin/ p cos(2/ m + 2w m - 3/ p - 3ro p ) 

21 , ,9 

sin ip cos(2/ m + 2ro m - / p - ro p ) 



21, X 9 

"32 (1 + C ° rn) 

3 

-- sin J TO (1 - cos I m ) cos I p sin(2/ m + : 
3 

-~ sin I m (l - cos I m ) cos I p sin(2/ m + 2w m + f p + w p - Q m ) 



■ 2iO m f p W p + f^m) 



, — 20 



15 

- — sin 2 I m sin I p cos(/ p + w p 
3 

-— sin 2 J m sin/p cos(3/p + 3ct p - 2Vt m ) 
9 

- — sin I m sin I p cos(2/ m + 2w m + / p + w p ) 
' 2 ' : " cos(2/ m + 2uj m - f P - w p ) 



y • 2 r • r 

-— sm J m smJ p 



15 n r 

-T^l 1 - C0Si m/ 



AJ . ^wo im ) 2 sin/pcos(2/ m + 2o; m + / p + ro p - 20 m ) 
3 

- — (1 - cos/ m ) 2 sin/pCos(2/ m + 2w m + 3/ p + 3w p - 2fi m ) 

3 \ 
— - sin(2J m ) cos I p sin(/p + uj p — fi m ) J di'cfi. (6.40) 



As both orbits are circular, r m = a m , r p = a p , f m = n m t + f m (0) and 
fp = n p t + f p (0). Using these simplifications and performing the integrations 
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gives 

1 GM m M p a 2 m 
c M m + M v at 



m ~ lv± p u p 
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1"7777 77; ZT^9. cos ( 2 /m + 2w m ~ Jp ~ ^p 
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+ t 77^ N2 sm(2/ OT + 2w m - f p - zu p + il m ) 

4 [Zn m Tip) 



3 sin J m (l - cos J m ) cos I p . 
+ t 773 — 72 sin(2/ m + 2u m + fp + Wp- \l m ) 

4 ^Z71. m + Tip 



15sin 2 / m sinl p 
+ 2 cos \fp + ^p~ 2i2 m ) 

3 sin 2 / m sin/ p , . 
16 (3n ) 2 cos Wp + 3ro P ~ 2 ' 2m ) 

9 sin 2 I m sin I p 

+ 16 (2^T^F cos(2/m + 2 " m + /p + Wp) 

9 sin 2 / m sin/ p 

cos(2/ m + 2w m - /p - tup) 



16 (2n m - ?ip) 
15 (1 — cos/ m ) 2 sin/„ 

+ 777; 775 T 72 cos ( 2 /m + 2w m + Jp + - 2i2 m ) 

oZ yZfi rn -\- Tip j 



3 (1 — cos Im) sin /„ 

+7777 — 77; —7, — 7^5 — cos(2/ m + 2w m + 3/p + 3tUp - 2il r 

32 (2n m + 3np)^ 



3 sin(2/ m ) cos I v ,, ^ A .„ 
+7 2 - sin(/ p + w p - fi m ) . (6.41) 

4 n p y 

Consequently, higher values of inclination modify the amplitude of the sinu- 
soids with frequency 2n m — n p and 2n m — 3n p (the frequencies corresponding 
to the circular coplanar case) , with frequency 2n m + n p (the frequency corre- 
sponding to the slightly inclined case) and introduce sinusoids of frequency 
3n p and 2n m + 3n p . 

In addition to adding new frequencies to the perturbation, high mu- 
tual inclination can also change the form of the perturbation (compare fig- 
ures 6.3 and 6.5). In particular, mutual inclination is capable of changing 
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Figure 6.5: Functional form and frequency composition of the time-of-arrival 
perturbation due to planet-moon binarity for the case where the planet and 
moon's orbits are circular and the orbital planes of the planet and moon are 
perpendicular (I m = tt/2). These curves were calculated for the case of a 
PSR B1620-26 b analog, in particular, it was assumed that M p = 2.3Mj, 
0.2Mj and a m = 0.8AU. 



a p = 23AU, M, 



TOA per t tP m from something that looks like a sinusoid to something that 
looks like a beat (see figure 6.5). As can be seen, while mutual inclination 
does not substantially alter the maximum amplitude of the perturbation 
over a full planetary orbital period, it can reduce the amplitude for lengths 
of time much smaller than an orbital period, where we recall that the period 
of the envelope function is half a planetary orbital period. Consequently, 
taking mutual inclination into account is very important for placing limits 
on moons of pulsar planets such PSR B1620-26 b which have orbital periods 
much longer than length of time over which they have been observed. Now 
that the effect of mutual inclination on the time-of-arrival perturbation has 
been investigated, we move on to investigate the effect of eccentricity, in 
particular, the effect of eccentricity in the moon's orbit. 
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6.7 Slightly eccentric moon orbits 



While most moons are expected to form on circular orbits (see chapter [3]), 
captured moons begin with very elliptical orbits. In addition, even for the 
case of a moon on an initially circular orbit, subsequent orbital evolution 
can increase the orbit's eccentricity (e.g. Hut, 1981). Consequently, it is 
of interest to investigate the case where the moon's orbit is eccentric. In 
this section the case of slightly eccentric moon orbits will be investigated 
by considering the expression for the time-of-arrival perturbation correct 
to first order in 6 m . This case was selected as first, it indicates the types 
of effects eccentricity in the moon orbit can have on the perturbation and 
second as it is substantially simpler than the general case. 

For the case where the planet and moon's orbits are coplanar, but the 
moon's orbit is eccentric, 8 P = n/2, ip p = f p + oo p + f2 p , m = tt/2 and 
"0m = fm + w m + ^m- However, as the moon's orbit is eccentric, terms 
involving r m and f m are no longer simple functions of time. Using these 
expressions, equation (6.17) becomes 



TOA 



pert,pm 



1 GM m M p 
cM m + M p 



t r-t' 

o Jo 



v m=-2,2 



(2-m)! 
(2 + m)\ 



m imj„ 
2 



(P 2 m (0)) 2 [-3e rp - ime^ p ] ■ ndfdt', (6.42) 



where the terms corresponding to the moon's orbit have been grouped into 
one factor using square brackets. Applying the expansion presented in equa- 



tion (6.10) to the term in square brackets, where we note that the Fourier 
coefficients Sn m \e m ) are given to order e 2 m in appendix |d[ gives 



1 GM m M p A* ' ^ (2-m)\ 2 al 



r-t rt' I 2 00 
4 2m) (em)e inMm( ' } x e'K"" 1 ^) [-3e rp - ime^ p ] \ ■ ndt'dt. (6.43) 



This equation describes the time-of-arrival perturbation for the case of ec- 
centric moon orbits. 

In the case of low eccentricity, the terms of order and above can be 
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neglected. Truncating the expansions in table 



D.l 



to order e m gives 



.(22) 
'2 

,(22) 



(cm) — 3e m , 



,(22) 
'3 

,(20) 
'0 

,(20) 



(e m ) 
(^m) 



$1 (Cm) 



(6.44) 
(6.45) 
(6.46) 
(6.47) 
(6.48) 



Only including the above terms in equation (6.43), noting that s 
s^ n m ^*, and combining the complex exponentials into sinusoids gives 



(Zm) 



TOA 



pert,pm 



1 GMrnMp [* I* al 
cM m + M p J J rl 
9 



3 



4 
6 

~4 

96m, 

"T" 

6e m 



cos(2M m (t) + 2ro m - 2/ p - 2w p )e rp 



sm(2M m (t) + 2w m 
cos(3M m (t) + 2w Tl 
sin(3M m (i) + 2w n 



4 

H — ^ cos(M m (t) + 2ro r , 
+ ^^sin(M m (t) + 

36 r , 



2/p - 2wp)e-,p p 
^/p 2rop)e r;) 

- 2/p - 2n7p)e^, p 
2/p 2n7p)e rj) 

- 2/p - 2tUp)e^ p 
cos(M m (t))e rp ) • ndi'di. (6.49) 



Substituting in equation (6.25) and expanding the trigonometric products 
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gives 



1 GM m M„ . , f* ft' al ( 3 



15 

-— cos(2M m (f) + 2ro m - / p - ro p ) 
3 

-- cos(2M m (£) + 2w m - 3/ p - 3zz7 p ) 
o 

15e 

^ cos(3M m (t) + 2ro m - f p - w p ) 

3e 

cos(3M m (i) + 2ro m - 3/p - 3ro p ) 

o 

45e 

H — cos(M m (i) + 2w m - f p - Wp) 
9e 

+ ^ sin(M m (i) + 2w m - 3/ p - 3tup) 
3e 

H — ^ cos(M m (i) - /p - rop) 
3e 

+ — - cos(M m (i) + f p + wp) ) cft'dt (6.50) 



Noting that r p = a p , ^ = n p and dM ™(^ = ri m , performing the integrals 
and simplifying then gives 

1 GM m M p . T a 2 m f 3 
TOA pert , pm = - c M-^- p ^I P ^ - S (/ P + wp) 

15 

+ «?9 Z ^2 cos ( 2M m{t) + 2w m ~ fp~ W p ) 

o{Zn m rip) 
3 

+ aro q — ^2 cos ( 2M m(t) + 2w m - 3/p - 3w p ) 

o(Zn m orip) 



15e^ 
8(3n m - rip) 2 



+ ^7^ ~ \ \2 cos(3M m (i) + 2w m - f p - w p ) 



+ 36m ^ ^ cos(3M m (i) + 2ro m - 3/p - 3wp) 



8(3n m — 3n p ) 5 
45e m 
8(n m - rip) 2 

9 Cm 



cos(M m (t) + 2w m - f p - w p ) 



^ sin(M m (t) + 2w m - 3/p - 3w p ) 
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3e r , 



4(n m - n p ) 

4(n m + n p 



2 COs(M m (t) - / p - tJ7 p ) 

2 cos(M m (t) + fp + Tu p ) ) . (6.51) 



As seen by comparing equation (6.51) with equation (6.29), the pertur 



bation in the case of circular coplanar orbits, the effect of a small amount 
of eccentricity in the moon's orbit, is to split each of the frequencies corre- 
sponding to the circular coplanar case into three frequencies (see figure 6 



The origin of this splitting can be seen by considering that the only non-zero 
coefficients to appear in the low eccentricity case correspond to terms with 
n = m + 1, morm-1. For orbits which are eccentric enough such that 
the assumption of low eccentricity is no longer valid, more coefficients will 
be relevant, and thus more than three frequencies will be produced by the 
splitting. 

As for the case of mutually inclined orbits these additional frequencies 
act to modify the form of TOAp ertj p m (see figure 6.6). Again considering 



the beat analogy and attributing the envelope function to the motion of the 
planet-moon pair about the pulsar and the high frequency "sinusoid" which 
the envelope function modifies to the motion of the planet and moon about 
their common barycenter, we expect that eccentricity in the moon's orbit 



would modify this "sinusoid". From figure 6.6 we can see that this is truly 
the case. We now move on to investigating the effect of eccentricity in the 
planet's orbit. 



6.7.1 Slightly eccentric planet orbits 



An investigation of the effect of eccentricity in the planet's orbit is scien- 
tifically interesting for two main reasons. First, while planets in the Solar 
System have orbits that are well approximated by circles, many extra-solar 
planets do not, for example, nearly half of the planets presented in the 
extrasolar planet encyclopedia have eccentricities larger than 0.1. In addi- 
tion, for the particular case of pulsar planets, the two outer planets in the 
PSR B1257+12 system have orbits with low, but non-zero eccentricities of 
0.0186 and 0.0252, and it is thought that the orbit of PSR B1620-26 b is ec- 



centric (Ford et al. 2000 Siggurdsson & Thorsett, 2005). Second, recall from 



section 5.1, that for the case of circular coplanar orbits, the amplitude of 

\2l 



TOA 



pert,pm 



is approximately sin I p [9(M p M m ) / 16(M p + M„ 



i/r p ] times 



the system crossing time r p /c. As a result of the dependance on r p , it can 
be intuitively seen that variation in r p over a full planetary orbit is likely 
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Figure 6.6: Functional form and frequency composition of the time-of-arrival 
perturbation due to planet-moon binarity for the case where the planet and 
moon's orbits are coplanar, the planet's orbit is circular, and the moon's 
orbit is eccentric (e m = 0.05). These curves were calculated for the case of 
a PSR B1620-26 b analog, in particular, it was assumed that M p = 2.3Mj, 
a p = 23AU, M m = 0.2Mj and a m = 0.8AU. 



to have a marked effect on the size and structure of the perturbing signal. 
As for the case of moon eccentricity, we investigate the effect of planetary 
eccentricity in the low eccentricity regime by deriving an expression for the 
time-of-arrival perturbation due to planet-moon binarity correct to first or- 
der in e p . 

Assuming the moon's orbit is circular and coplanar with that of the 



planet, equation (6.17) becomes 



TOA 



pert,pm 



rt rt' 
cM m + M p J Ju 



1 GM m M p 



(2-m)! 




xe i(mf m +m^ m -m^ p )^n (p™( ))2 [_ 3e ^ _ ime ^ j . n dt" dt' , (6.52) 
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where 



n = sin I p cos(/ p + w p )e rp - cos I p e dp - sin I p sin(/ p + w p )e ( j )p . (6.53) 



It can be seen that in the case of eccentric planet orbits some extra work 
must be done before progress can be made. This is because f p is no longer 
a linear function of time. Consequently, the cos(/ p + w p ) and sin(/ p + w p ) 
terms within n must also be included in the Fourier expansion. For the 
cos(/ p + w p ) term we have 



n 4 1 a 4 1 o 4 

_P e -imf p ^ a ff ^ _ J^ P tZP p -i(m-l)fp + e ~i™ P ± > p -i(m+l)f p 



cos(f p + w p ) = e l ™*--^e 



which can be written as 



2 r 4 
v 



.54) 



1 oo 

e~ imf ' cos(/ p + w p ) = e iw *- F^ 7n ~ 1 \e p )e- inM ^ 

n=— oo 

1 oo 

+ e -^ P _ ^ 4 3 ' m+1 )(e p )e- inM f W . (6.55) 



Similarly for the sin(/ p + w p ) term we have that 



4 -, oo 

f e" im ^ sin(/ p + w p ) = e iro p- ^4 3 ' m - 1 )(e p )e- iriM f W 

1 oo 

_ e i^ P F^' m+1 ^ (e p )e- inM " {t) , (6.56) 



n=0 



where it is now written in terms of F^' m ^ and i?( 3 > m + 1 ). 3 



Using equations (6.55) and (6.56), equation (6.52) can now be written 



3 Both i^ 3 '™- 1 ) and F (3 ' m+1) can correspond to parent spherical harmonics where the 
absolute value of "m" is larger than /. As Y™ = for \m\ > I (see equation ( 6.4 1) the term 
Y^pd.m) in equation ( |6l7| is equa i to zero independent of the value of F c " As F (l ' m) 
are Fourier coefficients of functions of the form (a l p +1 /r l p +1 )e~ tm ^ p , there is no physical 
limit on the values of I and m. So, while coefficients of the form i^ 3,4 ' and ,p' 3 ' _4 ) have 
meaning, they are not naturally occurring in the expansion of the disturbing function as 
usually they would be premultiplied by zero. 



105 



as 



TOA 



pert,pm 



1 GM m M p . 

- — T-r- sin i„ 

c M m + M p v 



t ft' 
o Jo 



-imixj v ' m c »(m./m+mCT ro ) (P™(0)) 2 



E 

, m=-2,2 
oo 



(2 - m)! 
(2 + m)! 



3 + m 



■inM p (t) 



dt'dt. (6.57) 



This is the governing equation for eccentric planet orbits. In the limit of low 
eccentricity, terms of order e p and above can be safely neglected. The only 
terms in table D.2 which are non-zero once the e p terms have been neglected 
are: 



F^>(e p ) = -e p , (6.58) 

F 3 (33) (e p ) = l, (6.59) 

F 4 (33) (e p ) = 5e p , (6.60) 

F 2 (32) (e p ) = l, (6.61) 

F 3 (32) (e p ) =4e p , (6.62) 

F (31) (e p ) =e p , (6.63) 

F 1 (31) (e p ) = l, (6.64) 

F 2 (31) (e p ) = 2e p , (6.65) 

if 0) (e p ) = l, (6.66) 

F[ m \e p ) = 2e p . (6.67) 



Expanding equation (6.57), only retaining these terms and combining 
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them into sinusoids gives 

tha l GM m M p smI p [* [*' r 2 m ( 3 v 
TOA pert<pm = * — / / -j I - - cos (M p (t) + ro p 

3 15 

-- cos(2/ m + 2tu m - M p (t) - ro p ) - — cos(2/ m + 2ro m - 3M p (t) - 3u p ) 

o o 

3e 9e 
— / cos(2/ m + 2ro m - rop) cos(2/ m + 2tu m - 2M p (i) - tu p ) 

o 

15e 9e 
+— ^ cos(2/ m + w m - 2M p (t) - 3w p ) - — 2 cos(2M p (t) + u p ) 
o 4 

75e 3e \ 

-5 cos(2/ m + 2tu m - 4M p (i) - 3ro p ) - — £ cos(ot p ) (it' (it. (6.68) 

8 4 y 

Noting that r m = a m and that f m = ra m t + / m (0) and M p {t) = n p t + M p (0), 
and performing the double integration gives 

m ^ , 1 GM m M v sin L, rL ( 3 , „ , , s 

row. = -- Mm+ V, t| (s? cos(m * w + 

15 

+ a7o q — ^2 cos ( 2 /m + 2tu ™ - 3Mp(t) - 3wp) 

o \ zn m on p ) 

3 

- u ~2 cos(2/ m + 2w m - M p (t) - w p ) 



8(2n m - n p ) 
9e 

+ ^ _% , 2 cos(2/ m + 2ro m - 2Mp(t) - rop) 

15e 

cos(2/ m + w m - 2M p (t) - 3w p ) 



8(2n m - 2n p ) 2 
75e 

+ ^ cos ( 2 /m + 2ro m - 4M p (t) - 3w p ) 

9e 3e \ 

+ ,, % cos(2M p (t) + w p ) + p cos(2/ m + 2w m -w p )\, (6.69) 

where the constant term has been neglected. As can be seen from equa- 
tion (6.69), eccentricity in the planet's orbit leads to the inclusion of terms 
with frequency 2n m , 2n m — 2n p and 2n m — 4n p in the perturbation. 

The effect of these extra frequencies is to modify the shape of the per- 
turbation. In particular eccentricity in the planet's orbit modulates the 



4 Eccentricity in the planet's orbit also leads to the inclusion terms with frequency n v 
and 2n p in the perturbation. However, as the planet is on an eccentric orbit, TOA per t,p 
already contains terms with these frequencies. Consequently these terms will be unde- 
tectable as a separate signal. 
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Figure 6.7: Functional form and frequency composition of the time-of-arrival 
perturbation due to planet-moon binarity for the case where the planet and 
moon's orbits are coplanar, the moon's orbit is circular, and the planet's 
orbit is eccentric (e p = 0.1). These curves were calculated for the case of a 



PSR B1620-26 b analog, in particular, it was assumed that M p 
M m = 0.2Mj and a m = 0.8AU. 



2.3Mj, 



a p = 23AU 



envelope function over a planetary period (see figure 6.7). This result is 
physically sensible in that r p is periodic over a planetary orbital period. Re- 
calling that for the circular-coplanar case, the timing perturbation due to 
planet-moon binarity was proportional to l/r~, we would expect TOA. per ^^ pm 
to be large at periastron (when r p is small), and small at apastron (when 



r p is large). Noticing that the orbit shown in figure 6.7 is at periastron at 
t = 0, this is exactly what is obtained. Such an understanding is practically 
useful as it indicates that for the case of a pulsar planet on an eccentric orbit, 
observations aimed at detecting if it has a moon should be scheduled during 
or near periastron where it is expected that the perturbation is largest. 
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Figure 6.8: Diagram showing the effect on the time-of-arrival perturbation 
frequencies due to constant viewing angle and symmetry breaking processes 
such as mutual inclination and eccentricity for a) circular coplanar orbits b) 
orbits with low mutual inclination c) moon orbits with low eccentricity and 
d) planet orbits with low eccentricity. The splitting induced by inclination 
or eccentricity is shown in the gray box, while the splitting due to constant 
viewing angle is shown in the dashed box. In the case of an elliptical planet 
orbit, the splitting from both of these sources are inseparable and are conse- 
quently shown in a grey box with a dashed border. The initial and resulting 
frequencies are labeled, except in the case where neighbouring frequencies 
differ by n p . In this case, the lower and upper limits are given, separated 
by a vertical line. Finally, the fundamental frequency of the zeroth order 
time-of-arrival signal is denoted by a bold line. 
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6.8 Conclusion 



Expressions for the timing perturbation due to planet-moon binarity have 
been derived using a three-body formalism developed by my PhD supervi- 
sor, Dr. Rosemary Mardling. Using this formalism, the cases where the 
planet and moon's orbit were circular and coplanar, circular and mutually 
inclined, coplanar with an eccentric moon orbit and coplanar with an eccen- 
tric planet orbit were investigated. For the case of circular coplanar orbits, 
the expressions derived using this more general method exactly matched 
those produced in chapter [5} Then, building on this analysis, the cases of 
mutually inclined planet and moon orbits and slightly eccentric planet and 
moon orbits were investigated. For the case of mutually inclined orbits, it 
was found that slight misalignment resulted in additional terms with fre- 
quency 2n m — n p and 2n m + n p , and with amplitude proportional to the 
degree of the misalignment being added to the base circular coplanar sig- 
nal form, while larger values of mutual inclination altered the perturbation 
from something which looked like a sinusoid to something that looked like 
a beat function (compare figures 6.3 and 6.5). In addition, for the case of 
slightly eccentric orbits, it was found that, as for the case of slight mutual 
inclination, the expression for the perturbation was given by the sum of 
the perturbation for the case of circular coplanar orbits and a perturbation 
term proportional to the relevant eccentricity. In particular, for the case of 
slightly eccentric moon orbits the perturbation term contained sinusoids of 
frequency 3n m Tip, 3n m 3Tip, 2Ti m Tip, 2n m 3?2p, n m -\- np, n m Tip 
and n m — 3n p , while for the case of slightly eccentric planet orbits the per- 
turbation term contained sinusoids of frequency 2n m , 2n m — n p , 2n m — 2n p , 
2n m — 3n p and 2n m — 4n p . From a more qualitative perspective these ad- 
ditional frequencies resulted in a change in the shape of the high frequency 
oscillations in TOA per t tPm for the case of eccentricity in the moon's orbit, 
and a modulation of the envelope function of TOAp er t tPm over an plane- 
tary orbital period for the case of eccentricity in the planet's orbit. These 



results are summarised in figure 6.8 In line with the motivation of this 



chapter, these expressions, along with the transparent way in which they 
were derived, allow an understanding of the physical origin of form of the 
perturbation signal as a function of the orbital elements of the moon's orbit 
and allow this method to be extended to include pulsar planets on inclined 
or eccentric orbits. Now that moon detection around pulsar planets has been 
investigated, used to place limits on moons of a real pulsar planet and ex- 
tended, we shift our focus to the second moon detection technique analysed 
in this thesis, photometric transit timing. 
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Part III 



Detecting Moons of 
Transiting Planets 



in 



Chapter 7 

Introduction 



7.1 Introduction 



Before investigating which moons of transiting planets are detectable using 
the photometric transit timing technique (TTV P ), it would be instructive 
to summarize the mathematics and main results associated with the transit 
technique, detection of moons of transiting planets in general, and the TTV P 
technique in particular. This will be done in three main stages. First, 
pertinent results from the transiting planet literature will be summarised, 
in particular, the way in which the transit duration and the shape of the 
transit light curve depend on the system parameters will be discussed and 
the corresponding formulae for these quantities introduced. Second, the set 
of methods proposed in the literature to find moons of transiting planets 
using the transit light curves will be reviewed. Finally, the TTV P technique 
will be focussed on, with the aim of summarising the results presented in 
previously published work, defining where my work fits in that context, and 
also providing a more mathematically useful description of At, the TTV P 
test statistic. We begin with a discussion of the transit technique. 



7.2 Description of the transit technique 



The transit technique is a planetary detection technique where the presence 
of the planet is deduced by the dip in received intensity of its host star 
as the planet passes in front of it. This technique was first proposed by 



Struve (1952), who used images taken using photographic plates to search 
for transiting planets. While the technique was periodically revisited (e.g. 



Rosenblatt, 1971 Borucki & Summers, 1984 Borucki et al. 1985), it wasn't 
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Time 



Figure 7.1: Diagram showing the different portions of the transit light curve. 
The four shaded circles show the planet's position across the face of the star 
at the beginning and end of ingress, and the beginning and end of egress. 
As the position of the planet along the chord of the star is a linear function 
of time, it can be used as a proxy for time. Consequently the position of 
the planet and the value of the light curve resulting from that position are 
linked by dashed lines. 



until the advent of CCD technology which made wide-field surveys plausible, 
that the disadvantages of this technique started to be outweighed by its 
advantages (Kjeldsen & Frandsen, 1992), and could start to produce results. 

This method's main disadvantage is that in order to be detected, the 
orbit of the planet must be such that it passes in front of its parent star. As 



the probability of a given planet transiting is oc a„ 1 (Borucki & Summers 



1984 Barnes, 2007), where a p is the semi-major axis of the planet's orbit, 



it can be seen that a given planet is more likely to transit, and thus to be 
discovered, if its semi-major axis is small than if its semi-major is large. 
However, as the transit technique uses the star's total intensity, as opposed 
to the radial velocity technique, where the light must be split up to give 
high resolution, high signal-to-noise spectra, it can be seen that fainter stars 
can be targeted. Consequently, the disadvantages of this technique as a 
result of selection effects can be partially rectified as many hundreds or even 
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thousands of stars can be monitored at once. 

With the advent of wide field CCD surveys, this planet detection tech- 
nique has come of age with over 100 planets discovered using this technique. 1 
Not only does the transit technique allow for planetary detection, it also al- 
lows for the measurement of planetary radius, orbital inclination as well as 
observables such as orbital orientation relative to the star's spin axis (e.g. 



Queloz et al. 



2000 



2007) and even the presence or absence of moons (Sartoretti Sz Schneider 



1999 



Narita et al. , 2007), planetary oblateness (Hui & Seager 



2002| [Barnes fc Fortney 2003), atmospheric composition (e.g. |Charbonneau 



eTaL] [20021 |Vidal-Madjar et al.[ [20041 |Richardson et al.j |2007| |Tinetti et aT 



As with the concept, the mathematical techniques required to analyse 



these light curves are well established in the literature (Gimenez 2006). 



This is because the light curves from transiting planets are related to the 
light curves of eclipsing binary stars, which have been extensively studied 



(e.g. Kopal 1979). An example transit light curve is shown in figure 7.1 



The duration and the shape of this light curve depend on the shape and 
inclination of the planet's orbit, the relative sizes of the planet and the star 
and the degree of limb darkening exhibited by the star. To aide in further 
derivations, the effect of these variables on the duration and shape of a 
transit will be summarised. 

To begin the investigation on transit duration, we recall from chapter [2| 
that for this thesis, the transit duration is defined as the time between the 
center of ingress (ij n ,p) arid the center of egress (t eg , P )- That is, 



-tra 



t, 



t 



(7.1) 



As the position of a given planet is generally written in terms of the true 
anomaly, f p , as opposed to the time, t, an expression will be constructed 



for the transit duration in terms of f p . Following Kipping (2008) and using 



Kepler's second law (see equations (2.10) and (2.26) of Murray Sz Dermott 



(1999)), we have that 



dt 



Tlp&p 



1 



-dfp, 



(7.2) 



where t is time, r p is the distance between the planet and the star, a p is the 
semi-major axis of the orbit, e p is the eccentricity of the orbit and f p is the 



1 See, for example, http://exoplanet.eu/catalogue.php. 
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true anomaly. When integrated between ti np and t eqp , this equation gives 



m,p allu u eg,pi 

fit-- I r -^=df P , (7.3) 



fin, P n p a*Jl — 



r feg,p j>2 

^eg,p tin,p — / / = df p , C^.4) 

J fin, P n p a^Jl — e| 



thus 



- tra 



dfp. (7.5) 



' }in,p n p a p\/{^ e pj 

So, in order to determine the transit duration, fi n<p and f e g,p, the values of 
f p corresponding to the middle of ingress and middle of egress, are required. 

To obtain these values we begin by considering a keplerian planet orbit 
given by 

a p (l — el) 

rp(t) = 1 + Cp COB /„(*)' (7 - 6) 

where r p is the distance between the planet and the star, a p is the semi- 
major axis of the orbit, e p is the eccentricity of the orbit, and f p is the 
true anomaly. Rewriting this expression using Cartesian coordinates and 
rotating this orbit by the three Euler angles (see figure 7.2) gives 

Xp = r p cos fl p cos(/ p + Up) — r p sin £l p cos I p sin(/ p + U) p ), (7.7) 
y p = r p sin Q p cos(/ p + u) p ) + r p cos £l p cos I p sm(f p + uj p ), (7.8) 

where the three Euler angles, Q p , oj p and I p , represent the longitude of the 
ascending node, the argument of periastron and the inclination, respectively. 

Now, the center of transit ingress and the center of transit egress occur 
when the center of the silhouette of the planet and just touches the limb of 
the star, that is, when the center of the planet is R s from the center of the 
star, where R s is the radius of the star. Mathematically this occurs when 

R 2 s =x 2 p + y 2 p . (7.9) 



Substituting in equation (7.7) and ( |7.8| for x p and y p and simplifying gives 

R 2 S = r 2 p [cos 2 (f p + lj p ) + cos 2 I p sm 2 (f p + u p )]. (7.10) 

This equation describes the values of f p corresponding to the beginning and 
end of the primary transit. 2 In addition, we have that f p depends on R s , 



2 This equation may also describe the beginning and end of the secondary transit as 



well. 
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Figure 7.2: Schematic diagram of the method used to describe an orbit of 
arbitrary orientation. The x'", y'" , z" 1 coordinate frame is fixed to the orbit, 
such that the pericenter points along the positive x'" axis, and the orbit 
lies in the x"'-y"' plane. This orbit is then rotated sequentially through the 
three Euler angles, u, I and ft to give a description of the orbit in the x, y, z 
coordinate frame. This unprimed coordinate system describes the orbit with 
respect to an inertial reference frame. In particular, for this application, the 
x and y coordinate axes lie in the plane of the sky, while the z-axis point 
along the line-of-sight. 



r p , uj p and I p , but not on Q p , as it does not appear in this equation. This 
is reasonable as altering f2 p only alters the orientation of the path taken by 
the planet on the face of the star, and not the intensity along it. 



Continuing, to determine the transit duration, equation (7.10) needs to 



be solved in terms of f p for the times of ingress and egress. As this is a high 
order equation in f p , this is not trivial (see Kipping (2008) for a derivation 
of general expressions for Tt ra ). For this thesis, the full general expression 
is not required, so, we will look at three specific cases, partially to highlight 
the physics and partially as expressions for these quantities will be required 
in later chapters. These cases correspond to cases where the planet's orbit 
is circular and aligned to the line-of-sight, is circular, but slightly inclined to 
the line-of-sight and eccentric and aligned to the line-of-sight. Expressions 
for fi njP and f e9tP and thus Tt ra will be derived in turn for these three cases. 

For the case where the planet's orbit is circular and aligned to the line- 
of-sight, we have that r p = a p and l 
becomes 

' yl - a 2 [cos 2 (f p + u p 



tt/2. Consequently, equation (7.10) 



(7.11) 



and thus 



R s = ±a p cos(/ p + i 



'vi- 



(7.12) 



where, assuming n p , the mean motion, is positive, the plus and minus rep- 



resent the egress and ingress respectively. Rearranging equation (7.12) to 
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give an explicit expression for f p gives 



cos 



R, 



CO. 



p, 



± (sin — 
\ a pJ 



OJ 



(7.13) 
(7.14) 



where we have used the identity that sin(^4 ± tt/2) = ±cos(^4) and we have 
kept the solutions relevant to the primary transit. 

For the case of the planets of interest, we have that R s /a P € 1 as only 
distant planets are likely to keep their moons (Barnes & O'Brien, 2002). To 
first order in R s /a p , equation (7.14) becomes 



fp 



* ± Rs 
2 a„ 



OJ 



V 



Substituting this expression into equation (7.5), setting r p 
= 0, we have that 



(7.15) 



a p and 



Ttra 



o p n p a p 



;df p , 



> _1_ «s , . 



df P 



2R S 

ClpTlp 



(7.16) 

(7.17) 
(7.18) 



This is exactly the result that one would expect. Consider the numerator and 
the denominator of equation (7.18). The numerator is exactly the distance 
that the planet must travel to cross from one side of the star to the other, 
while the denominator is the velocity of a planet on a circular orbit. 

For the case where the planet's orbit is still circular, but slightly inclined, 
we have that r p = a p , I p ^ ir/2 and e p = 0. Thus equation (7.10) becomes 



R 2 S = Op[cos 2 (/ p + Up) + cos 2 Ip sin 2 (/ p + u> p )]. 
Using the identity that sin 2 A + cos 2 A = 1 we have that 



R 2 S — a p cos 2 I p 



i^sm 2 I p cos 2 (f p + u p ), 



which simplifies to 



R 2 — ai cos 2 Ip = ±a p sin I p cos(/ p + ui p ). 



(7.19) 
(7.20) 

(7.21) 
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Following the same method as used above, this can be written as 



cos 



-1 



7T 



± sin 



-i 



R? s — a 2 cos 2 I p 
a p sin I p 



' R 2 S — alp- cos 2 Ip 
a p sin I p 



UJ p , 



OJr. 



(7.22) 
(7.23) 



Noting that cos/ p is of order R s /a p , s and again only retaining terms up to 
first order in R s /a p , we have that 



fv 



7T 



\j (R s /a P , 



2 — COS 2 Ip — LOp. 



(7.24) 



Again conducting the integral we obtain 



2JR 2 S - a 2 cos 2 Ip 



-tra 



a n n 



(7.25) 



pi up 



Again, this is what we would expect. The distance travelled by the planet 
is exactly given by the numerator while the velocity is given by the denom- 



inator. In addition, from equation (7.25) we can see that the reduction in 



the transit duration with the increase in \I P — ir/2\, results from the shorter 
length chord over which the planet transits, and not a change in velocity. 
Finally, for the case where the orbit is aligned to the line-of-sight, but 



eccentric, we have that I p = tt/2 but that r p is given by equation (7.6). 



Consequently equation ( 7.10| ) becomes 



R,Z 



a p (l 



1 + e p cos f p (t) 



COS 2 (f p + UJp) 



(7.26) 



which simplifies to 



R s 



cos(/ p + u : 



1 + e p cos f p (t) 



!->)■ 



(7.27) 



As in the previous sections, we would like an expression for f p correct to 
first order in R s /a p , but in this case we will use a perturbation expansion 



3 Recall that cos/ p = S m in/a P where 8 m i n is the impact parameter. In addition, in 
order for the planet to transit, we must have that <5 m i„ < R s + R P ~ Rs- Thus cos/ p is of 
order R s /a p . 
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to obtain it. To simplify the expression we multiply both sides by (1 + 
e p cos f p (t)), and to ensure that the small term, R s /a p , is clearly identified 
we divide by a p , giving 



R.i 



(1 + e p cosf p (t)) = ±(1 - 4)cos(/ p + u) p ). 



We begin by writing f p as a perturbation expansion 

fp = fp,0 + tfp,l + ••• 



(7.28) 



(7.29) 



first order equations are 



where the small parameter e is equal to R s /a p . Substituting into equa- 
tion (7.28) and grouping terms of like orders we find that the zeroth and 

= ±(l- e 2)cos(/ p , + ^ P ), (7.30) 

(7.31) 



and 



1 + e p cos/ Pi o = ±(1 - e p )(-/ p ,i sin(/ p>0 + uj p )). 
The zeroth order equation can be solved to give 

f - n 

Jp,0 — 7T ~~ U P- 



(7.32) 



This result is unsurprising as it agrees with the expressions for f p for the two 
previous cases to zeroth order (see equations (7.15) and (7.24)). Substituting 

(7.33) 



this into equation (7.31) gives 

fp,i 

Thus, 



1 + e.„ sin qj 



I -el 



tt R s 1 + e p sin uj 
2 o p 



pi 



(7.34) 



to first order in R s /a p . 

Consider the equation for the transit duration, 



-tra 



/p,0+e/p,: 



1 



/j>,0— 71 D a„ A /l 



p v V * "p 

/p,0+e/p,l 



1 + e p cos f p (t) 



n p \/l — Jfp,o-ef Pl i 



l-e 2 
1 + e p cos f p (t) 



dfp 



(7.35) 
(7.36) 
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Following Kipping (2008) (see equation (A36)), we have that 



/ 



1 + e p cos f p (t) 



p — 2^/1 — ep 1 tan 



1 + e p 



tan 



e p(! ~ e p) sin /p 
1 + e p cos f p 



(7.37) 



Converting equation (7.37) into a definite integral with integration limits 
fp,0 — e fp,i an d fp,o + e/p,i) taking the Taylor expansion about f p = / Pi o and 
retaining terms of order e gives 



/p,0+e/p,l 



1 



1 + e p cos f p (t) 



df P 



cos 2 (tan 1 



i^tan^ 



cos 



2 /p.o 



2 

epC 1 ~ ep)(cos/ P; o + e p ) 
(1 + e p cos/ P; o) 2 



2e/ p ,i. (7.38) 



Comparing this with equation (7.36), and substituting in the expressions 
for f pfi , f P; i and e gives 



- tra 



2Rs 

n p a p (F(e p ,Up))~ 



(7.39) 



where F(e p ,uj p ) is given by 

(l + e p sina;) cos 2 (tan" 1 



F(e p ,uj p 



(l + e ) l -el 



cos 



2 (7T 



e p (sinLjp + e p ) 



1 — e 2 (l + e p sinwp) 



(7.40) 



Again, we have from equation (7.39) that the transit duration is dictated to 
by two factors. Again the numerator represents the distance travelled by the 
planet while the denominator (along with F(e p ,oj p )) represents the velocity 
of the planet. Consequently eccentricity modifies the transit duration by 
modifying the velocity of the host planet during transit. 
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Figure 7.3: Schematic diagram of the coordinates used to determine the 
shape of the transit light curve. The path of the center of the planet across 
the star's face is indicated with a dashed line. Also, one of the previous 
locations of the planet is shown using a dotted line. 



So, as discussed, the transit duration is determined by two factors, the 
distance travelled across the face of the star (which is modified by the in- 
clination) and the velocity at which it is travelled (which is modified by the 
eccentricity) . To give a feel for these values some example transit durations 



and transit velocities are shown in table 7.1 In addition to determining the 
transit duration, the shape of the transit light curve is determined by the 
orbital and physical properties of the planet and star. 

To determine the shape of a transit light curve produced by a given 



planet as it travels across a given star we consider the method of (Gimenez 



2006). We can write the luminosity of the star, L(t) as 

L(t) = L - a p (S(t)), (7.41) 

where Lq is the luminosity of the star out-of-transit and a p is the amount of 
light occulted by the planet. In addition, we note that a p depends on 6, the 
distance between the centers of the planet and star, but does not depend on 
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Table 7.1: Example transit durations (Tt ra ) and mid-transit planetary ve- 
locities (vtr) for a planet which transits the central chord of its Sun-like star, 
for a range of different values of a p , e p and orbital orientation. Note that the 
letters P and A correspond to a transit occurring at periastron and apastron 
respectively and the symbol corresponds to the case for a circular orbit. 



(f>, the angular position of the planet on the face of the star (see figure 7.3 
for definitions of 5 and (j)). Continuing, we have that a p is defined as 



aJ5(t)) 



I(p)fidA, 



(7.42) 



where S is the region of the star which is occulted by the planet, /i is 
the cosine of 8, the angle between the surface normal and the line-of-sight 



(see figure 7.4), dA is an infinitesimally small area element and I(/J,) is the 



intensity. While both theoretical (e.g. Claret 2000 ) and observational (e.g. 



Sing, 2010) constraints on exist, as will be shown in chapter [8j the size 
and form of the TTV P signal does not strongly depend on the form of /(//). 

Assuming that the velocity of the planet during transit can be considered 
to be constant, 8(t) can be written as 



<*(*) = \/ 5 mm + ((*-*0Mr) 5 



(7.43) 
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Figure 7.4: Diagram showing the relationship between 5 the distance be- 
tween the center of the star's face and the position of interest and the angle 
9, the angle between the surface normal and the line-of-sight. The surface 
of the star is shown using a thick line, the direction along the line-of-sight 
is shown using a dotted line, while lines showing the surface normal are 
dashed. 



Consequently, the transit light curve is given by 



L(t) = L -a (j b 2 min + {{t - t mid )v t rf 



(7.44) 



where b m in is the distance of closest approach between the center of the 
star and the planet, t m id is the time at which this closest approach occurs 
and vtr is the projected velocity of the planet across the star's face during 
transit. While equation (7.44) can be written explicitly in terms of Jacobi 



polynomials (Gimenez 2006), that level of detail will not be required for 
this analysis. 



As can be deduced from equation (7.44), and figure 7.3 the light curve 



L has a number of properties. The first property, symmetry about t m id, 
can be seen by noting that replacing t — t m id by t m id — t does not alter the 
equation. This symmetry is a direct consequence of the intensity /, being a 
function of the angle between the line-of-sight and the surface normal only, 
and that the velocity of the planet during transit remains constant. The 
second property is that the exact shape of the light curve depends on the 
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chord it makes across the star and how fast it travels along the chord, which 
is determined by the planet's orbit, the relative sizes of the planet and star, 
and the mid-time of the transit. 4 As a result, formally L should be written 

as 

L(t) = L(R S , R p , 

where R s and R p are the radius of the star and planet, cip, e^, ojp, I p and 
n p are the semi-major axis, eccentricity, argument of periastron, inclination 
of the planet's orbit around the star and the mean motion, and t m id is the 
time at which the projected distance between the center of the planet and 
that of the star is smallest. 



7.3 Extending the transit technique to find moons 
of transiting planets 

As discussed in chapter |4j there are many ways to extend the transit tech- 
nique to look for moons of transiting planets. While this chapter focusses on 
the photometric transit timing technique, four methods have been proposed 
to extend the the transit technique to search for moons. These methods 
are direct detection, barycentric transit timing, photometric transit timing 
and transit duration variation. Consequently, to provide a context for the 
following investigation, the rationale for each of these four methods will be 
briefly summarised using the transit light curve shown in figure |7.5[ before 
concentrating our investigation on the photometric transit timing technique. 



7.3.1 Direct detection 

The process of direct detection involves searching the region of the light 
curve near the planetary transit for any extra dips due to putative moons. 
For example, for the case of the light curve shown in figure 7.5 , the additional 
dip caused by the moon can be seen translated to the right of the dip caused 
by the planetary transit. 



7.3.2 Barycentric transit timing 

The barycentric transit timing technique (TTV^) involves searching for tran- 
sit timing variations (TTV) where the time of transit is defined by the cen- 
ter of the planetary transit (t m ^ iP ). Departures of consecutive transit times 

4 See appendix G for a proof that the shape of the light curve does not depend on Q, p . 
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Time 



Figure 7.5: Diagram showing the different portions of the transit light curve 
for the case where both a planet and moon transit. Four silhouettes of the 
planet and moon are shown, corresponding to the the beginning and end of 
planetary ingress, and the beginning and end of planetary egress. Planet- 
moon pairs which correspond to a single silhouette are joined by a solid line, 
while the location of the planet-moon barycenter is indicated by cross. As 
the position of the planet-moon barycenter is a linear function of time it can 
be used as a proxy for time. Consequently the position of the barycenter 
and the value of the light curve resulting from that position are linked by 
dashed lines. 



from strict periodicity, a result of motion of the planet around the planet- 
moon barycenter, could indicate the presence of a moon. For example, for 



the case shown in figure 7.5, the mid-time of the planetary transit occurs 



earlier than would be expected due to the presence of the moon. 
7.3.3 Photometric transit timing 

The photometric transit timing (TTV p ) also involves searching for aperi- 
odicity in transit times. However in this case the times used are no longer 
the center of each planetary transit, but the mean time during the transit, 



weighted by the photon deficit (see equation (7.46) for a definition). This 



particular formulation is interesting as it is affected both by the extra dip 
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due to the moon as well as by any lead or lag in the planet transit time 
caused by the presence of the moon. 



7.3.4 Transit duration variation 



Finally transit duration variation (TDV) a technique proposed by Kipping 



Q2009a|bD also uses timing to search for moons of a given planet, but instead 



of measuring the "mid-time" of the transits, the duration of the transit is 
used. Instead of focussing on timing deviations due to the changing posi- 
tion of the planet about the planet-moon barycenter as the barycentric and 
photometric transit timing methods do, this method endeavours to measure 
perturbations to the planet's velocity across the face of the star due to the 



moon. For the case shown in figure 7.5, the motion of the planet and moon 



about their common barycenter during transit result in a longer planetary 
transit duration than would have occurred had there been no moon. 

As this chapter concentrates on the photometric transit timing method, 
it would be useful to expand upon the short description given above. Con- 
sequently, the published results and limitations of this method will be dis- 
cussed in greater detail. 



7.4 The TTV P method 

7.4.1 Introduction 

To provide a context for the work presented in this thesis, we begin by 
summarising the current state of the field with respect to TTV p . In particu- 
lar, this will involve a statement of the definition of the TTV P test statistic 
followed by a summary of all previous work currently presented in the lit- 
erature, with particular emphasis on the results and the gaps. Informed 
by this summary, a more general definition of the TTV P test statistic will 
be proposed. Using this definition, expressions for the timing perturbation 
caused by the moon (named At) and the error on this time (named €j) are 
constructed. We begin with the definition of the TTV P test statistic r. 



7. 4.2 Literature definition of r, the TTV P test statistic 



The photometric transit timing method (TTV p ) was proposed by Szabo 



et al. (2006), and involves the statistic r, the first moment of the dip in 



the light curve, to search for timing perturbations due to moons. Following 
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Figure 7.6: Schematic of the transit light curve. The experimental data 
points are represented by dots and the theoretical light curve is represented 
by a thick line. 



(Szabo et al. 2006), r is defined as 



Ei*(*i) 



(7.46) 



where t, and a(U) are the times and observed absolute photon deficits for 
the i th exposure, and where the sum is carried out only over the region 



marked "transit" in figure 7.6 



7.4.3 Summary and discussion of previously published TTV P 
results 

While some work has been presented on whether or not moons could be 



detected ( Szabo et al. , 2006 ) and which physical properties of these moons 
could be determined (Simon et al. 2007), these analyses are by no means a 



complete description of the capabilities of this technique. In particular, the 
work presented in this chapter extends these analyses in three important 
ways. 



First, the analysis of Szabo et al. (2006) used two unrealistic assump- 
tions. The first assumption was that the ingress and egress times of the 
moon's transit is known, so that the sum could be carried out only over the 
planet and moon transits. The second assumption was that the total unoc- 
cluded luminosity of the star was known for the in-transit period so that the 
difference between this unoccluded luminosity and the measured luminosity, 
could be used to calculate a(tj). Unfortunately, both these quantities will 
not be known for real transiting systems. Consequently, the effect of relax- 
ing these assumptions will be investigated within the context of the physical 
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limits inherent to the system, for example, constraints to the time between 
planet and moon transit resulting from the requirement that the system is 
three-body stable. 

Second, the way that the moon detectability depended on the physical 
parameters of the system is unknown. The Monte Carlo analysis used by 



Szabo et al. (2006) involved producing 500 realisations for a range of ran- 
domly selected planet moon systems including both terrestrial and gas giant 
planet-moon pairs. In addition to determining that realistic moons could 



be detected, Szabo et al. (2006) also used the results of their simulations 
to propose a number of factors which increased moon detectability. These 
were: 

• Shorter exposure time 

• Increased planet semi- major axis 

• Increased moon semi-major axis 

• Decreased relative photometric noise. 

While this approach indicated that, realistic moons could be detected using 
this technique, and identified a number of factors which increased moon 
detectability, they did not give the functional dependence on these factors. 
Consequently, for the case where these factors may be related, for example, 
for the case where the photometric noise is shot noise dominated, decreasing 
the exposure time increases the relative photometric noise, the result of 
altering a variable such as the exposure time is unknown. Consequently, in 
this Part, approximate analytic relationships will be derived which relate 
the detectability of a given moon to physical parameters of the star, host 
planet and moon. 



Finally, the approach adopted by Szabo et al. ( 2006 ) is of limited use to 



observers as it does not provide a simple way to determine the statistical 
significance of a detection or calculate a TTV P detection threshold. To do 



this using the Monte Carlo method of Szabo et al. (2006) would involve 
constructing many realisations of the light curves and determining the per- 
centage of these virtual moons which would have been detected. As this 
would involve creating a set of models spanning the range of possible physi- 
cal (R m and M m ) and orbital parameters (a m , e m , I m , u m , Q m and / m (0)) 
of the putative moon, and requires a large number of realistic realisations 
of the photometric noise, this is not a trivial procedure. Alternatively, in 
this Part, the issue of determining statistical significance, and generating 
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Figure 7.7: Updated schematic of the transit light curve. The experimen- 
tal data points are represented by dots and the theoretical light curve is 
represented by a thick line. A possible "transit" region is also indicated. 



thresholds, is addressed in three main ways. First a method for determin- 
ing the significance level of a detection in terms of measured variables will 
be presented. Second, using this method along with analytic expressions 
describing the timing perturbation and the behaviour timing noise, expres- 
sions which approximately describe the detection threshold will be derived. 
Third, as will be seen, for the cases where a Monte Carlo simulation must 
be run, the insight gained from the derivation of these approximate detec- 
tion thresholds can be used to determine which variables are important and 
which variables are not. By concentrating on these physically important 
variables, the computational load of calculating a threshold is dramatically 
reduced. 

With these three aims it mind, it can be seen that the definition of r 
needs to be investigated in two important ways before analytic expressions 
for moon detectability can be derived. First, the definition of r needs to be 
expanded such that knowledge of of the moon's position during transit and 
the unoccluded intensity of the star is not required. Second, expressions for 
the mean value and error in r for a given transit need to be determined in 
terms of the parameters of the system. 



7.4.4 Generalising the definition of r 



In order to generalise the definition of r, the assumptions made by Szabo 



et al. ( 2006 ) need to be relaxed. In both cases, this process can be informed 



by the physics of the system. 

In order to relax the assumption that the unoccluded luminosity of the 
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star is known, we are confronted with two main issues. First, the unoccluded 
luminosity of the star is not necessarily constant across the length of the 
transit, for example, the luminosity of the star may drop as a starspot 
passes onto its face. Second, the out-of-transit portions of the light curve 
cannot be used to form an estimated unoccluded luminosity as these portions 
may be contaminated by the transit of the moon. Consequently it was 
decided to treat the unoccluded intensity of the star as if it were constant and 
absorb the variability of the stellar intensity into the photometric variability. 
Fortunately, it can be shown that a small amount of error in the exact value 
of the unoccluded luminosity selected has little to no effect on the value or 
variability of any given r value (see appendix |F|) . Thus, the practical effect 
of this error can be ignored. 

The definition of r given by equation (7.46) assumes that the position 
of the moon during each planetary transit, and consequently the location of 
the dip caused by the moon in the light curve, are known before the moon 
is detected. Unfortunately, this information is not known a priori, and thus 
the definition of r must be modified. Consequently, it was decided that 
instead of evaluating the sum over an a priori unknown interval, the sum 
would be evaluated over a region of length T f, s centered on the planetary 
transit. For simplicity, the region selected for this thesis was the smallest 
region always ensured to include the moon's transit (see figure 7.7). In 
particular, this region includes the planetary transit, along with a margin of 
length a m {\+e m ) /vtr either side of the planetary transit, where we note that 
a m (l + e m ) is the distance between the planet and moon at apocenter and 
that vtr is the velocity at which the planet-moon barycenter transits the 
star. While this assumption does require that something must be known 
about the moon before it is detected, it is useful for two reasons. First, it is 
an improvement, in that this work only assumes knowledge of one variable, 
a m (l+e m ), as opposed to three, a m , e m and / m (0)+w m . Second, it results in 
a significant reduction in complexity when determining detection thresholds 
(this issue is further discussed in section 10.3.1). Finally, limits on the size 
of this margin (in particular limits relating to a m ) can be constrained using 
limits from our understanding of moon formation and orbital evolution. 

As discussed in chapter [3j limits can be placed on the properties of moons 
of extra-solar planets as a result of their formation and consequent evolution. 
In particular, limits on their semi-major axis (and consequently the time 
delay between their transit and that of their host planet). These limits come 
in two main varieties. First, an educated guess on where moons are likely 
to be (based on the current understanding of moon formation). Second, a 
more broad description of where moons could possibly exist without being 



131 



rapidly destroyed or lost from the planet. These two cases will be discussed 
in turn. 

We begin by considering the places where moons are likely to be found. 
As formation processes and evolution timescales differ for terrestrial and gas 
giant planets, these cases will be discussed separately. 

For the case of terrestrial planets, impact generated moons form very 
close to their host planet, and then, their orbits evolve outward. Conse- 
quently, the mass of these moons is determined by the impact process and 
their final semi-major axis depends on the orbital evolution of the moon, 
with the orbit of more massive moons evolving more quickly than that of less 



massive moons (see section 3.4.1). Consequently, given an upper mass limit 
(0.04M p ), a model of the evolution process, an estimate of the Love number 
(&2p), the Q- value (Q p ) and the age of the system, the semi-major axis of 



such a moon should be able to be predicted (e.g. using equation (3.16)). 
Assuming that the physics used to calculate the orbital evolution is correct, 
this semi-major axis can be treated as an outer limit as other processes, such 
as tidal locking can halt orbital evolution. For the case of an Earth-like 5 host 



planet at 0.2AU and 1AU, the largest region allowed by the equation (3.16) 
constraint will begin and end 3.4 hours and 7.7 hours, before and after the 
planetary transit respectively. 

For the case of gas giant planets, large moons generally form in an ex- 
tended region which is relatively close to their host planet. Unfortunately, 
as the way in which the size of this region in which regular satellites form 
scales with planetary parameters is not fully understood, 6 it is difficult to 
use it to place firm limits on the location of regular satellites. However, using 
Solar System data we can suggest qualitative limits on where large moons 



of gas giant planets are likely to be found. From table 7.2 we have that all 
the large regular satellites of gas giants lie within three solar radii of their 
host. Assuming that the host-star is Sun-like, for the case of a host planet 
at 0.2AU and 1AU, the largest region allowed by this constraint will begin 
and end 8.7 hours and 19.5 hours, before and after the planetary transit 
respectively. 

In addition to the window in which moons are likely to be found, there is 
also the window in which moons can possibly exist, that is, not be instantly 
destroyed or lost by tidal disruption, impact with the host planet or, by 
three body instability. In particular, moons are destroyed if their orbits are 



b k 2p = 0.299, Q v = 12, T = 5Gyr, M v = M m and R p = 7? . 

3 For example, Mosqueira & Estrada (2003a b) suggest that the size of this region scales 



with Rh while Canup & Ward ( 2006 1 suggest that it scales with R p 
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too close to their host planet, and are three-body unstable if their orbits 
are too distant. As we would like an upper limit to the size of the window 
that we need to search, we will consider three-body instability only. From 



equation (3.13) we have that, for a prograde moon to be orbitally stable, 
the semi-major axis of the moon must be less than approximately 0.5 Hill 
Radi, where Rh, the Hill radius is defined as 



Rh = a p 1 



M^ 1 ' 

3M S 



This corresponds to a constraint that the moon can lead or lag the planetary 
transit by a factor of 0.5Rh /2ira p times the orbital period of the planet. 
For example, using this approximation, for the case of a Earth-like planet 
orbiting a Sun-like star at 0.2AU and 1AU, this requirement means that the 
transit of a moon can only lead or lag the planetary transit at most by 0.6 
hours and 7.0 hours respectively. Similarly, for the case of a Jupiter- like 
planet orbiting a Sun-like star at 0.2AU and 1AU, this requirement means 
that the transit of a moon can only lead or lag the planetary transit at most 
by 3.4 hours and 48.3 hours respectively. 



7.4.5 Analytic groundwork 

Now that r has been redefined, the mechanics required for searching for a 
signal must be constructed. In order to search the sequence of r values, 
we need to be able to write r as a function of transit number, moon and 
planet orbital parameters as well as the influence of any photometric noise. 
Assuming that the planet has only one moon, 7 Tj, the r value calculated 
from the j th transit where the numbering begins at zero, can be written as 

Tj = t + jT p 

~i~ At yj, (l m , 6 m , f m (to) , I-mi Ip,&p, bJpi Q"rn ^p, 1p, T m ) -|- €j , (7.47) 

where At is a function representing the effect of the presence of the moon 
on Tj, Ej represents the timing error due to photometric noise, to would have 
been the mid-time of the zeroth transit for the case where the planet did not 
have a moon, and where T p and T m are the orbital periods of the planet and 
moon orbits respectively. In addition, At is a function of a m , e m , f m (to), I m 
and oj m , which are the semi-major axis, eccentricity, true anomaly at time 

7 For the case where additional moons are suspected, equation ( |7.47[ ) can be modified 
by including additional At terms. 
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to 

and uj 



the inclination and argument of pericenter of the moon's orbit, I p , e p 



Pi 



which are the inclination, eccentricity and argument of periastron 
of the planet's orbit, and O m — £l p , the difference between the longitudes of 
the ascending node for the planet and moon's orbit. 8 We begin by writing 



a{U) = a p (U) + a m (U) + a n (U 



(7.48) 



where a p is the absolute dip due to the transit of the planet, a m is the 
absolute dip due to the transit of the moon 9 and a n is the noise on the light 
curve due to photometric variability. Note that a n is defined such that it 
has a mean of zero and consequently it can have positive or negative values. 
In particular, using equation (7.42), we can write a p and 



a p (S p (t)) = / I(n P )n P dA, 



(<5m(*)) = / I(fJ-m)HmdA, 

>s„ 



(7.49) 
(7.50) 



where S p and S m represent the region of stellar surface occulted by the 
planet and moon respectively and fi p and [i m are the \i values, again corre- 
sponding to the position of the planet and moon respectively. Also, as we 
will investigate the effects of different types of photometric noise on r, we 
will keep the definition of a n as general as possible. 

Substituting equation ( |7.48| ) into equation ( |7.46 1 gives 



Yjj ti(a P (U) + a m (ti) + a n (tj)) 
J2i a p(ti) + ot m {ti) + a n (ti) 



(7.51) 



As a n is present in both the numerator and the denominator, the numerator 
and denominator are correlated. 

To see how this correlation manifests practically, we consider an example. 
First, for clarity, equation (7.51 ) is reformatted by writing t j = t i — t m id iP + 
t m id,p, where t m id, p is the mid-time of the window, and simplifying, giving 



T — tmid,p 



Y.jjtj - tmid,p){a£ P (ti) + a m (tj) + a n (ti)) 
Ei a P (ti) + a m (ti) + a n (U) 



(7.52) 



See section 



,2.3 



9 For the case where the moon is eclipsed by or eclipses its host planet, the moon will 
not cause an additional dip, i.e. a m = 0. However, for both these cases At w 0, that is, 
the true value of At is approximately equal to the value of At if there were no moon. 
Consequently the situation where a m — 0, when the moon is in front of or behind the 
planet, will be neglected in this thesis. 
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Now, consider the case where one value of a n , lets say a n (tk), is large (and 
positive) and where tk > imid,p (i.e a data point near the egress of the tran- 
sit). As (tk — t m id, p ) is positive, a n (tk) acts to make both the denominator 
and numerator of the fraction more positive. These two effects partially 
cancel, thus, the error in r resulting from the effect of a n (tk) will be small. 
Conversely, for the case where a n (tk) is still large and positive, but where 
tk < t m id,p (i.e a data point near the ingress of the transit) this is no longer 
the case. As (tk — t m id lP ) is now negative a n (tjS) now acts to make the nu- 
merator smaller but the denominator larger. Consequently, for this case the 
error in r resulting from the effect of a n (tk) will be amplified. For the case 
where a n (tk) is large and negative, these effects are reversed. As a result of 
these behaviours, we cannot investigate the denominator and numerator of 



equation (7.51) in isolation, and then combine the results. 



Consequently, to make analytic progress, equation (7.51) must be refor- 
matted to remove this correlation. Assuming that £\ a n (ti) <C ^2i(a P (U) + 
<^m(U)), the binomial expansion can be used to expand the denominator of 



equation (7.51), giving 



Yi ti(a p (ti) + a m (U) + a n (ti)) 



Yi a n(U 



52iOt p (ti) + a m (ti 



(7.53) 



Expanding equation (7.53), neglecting terms of order E a n/EK + a m)) 2 
and gathering terms linear in a n under the same sum sign gives 



Yjj ti(u P (tj) + a m (tj)) 

Yi a p(U) + OL m (ti) 

1 



+ 



Yli <Xp(U) + a m (ti 



E 



Y^j U(a p (ti) + a m (tj)) 

YiUpiti) + a m(U) 



a n (U). (7.54) 



Note that as this first term does not contain a n , it is exact and consequently 
only contributes to jT p + to + At. Expanding and then contracting the first 



term of equation (7.54) gives 



+ 



Yi a m(U) Y,i t i a m(U 



Yi GLp(ti) + a m (ti) Si a p(*i) Yi a pi t i) + a m(U) Yi a m(t 



+ 



^2i a p(U) + a m(U 



E 



Yj U(a P (tj) + a m (ti)) 
X)i«p(*i) + a m (ti) 



a n (ti), (7.55) 
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or 



ApTp -\- A n 
A„ + A Ti 



+ 



1 



A m 



a. 



(7-56) 



/J- 



Sja m , the area of the dips caused by the 



where A p 

planet and moon respectively and where t p and r m are defined by t p = 

(^2i t iOi p (ti))/(Y^i a p( t i)) and T m = {Y.i^miti)) / {Y,i a m{ti))- A m and A p 
can be written explicitly in terms of the position of the planet and the moon 
respectively using equations (7.49) and (7.50). Noting that J adt « At Yl a i 
we have that 



A,, 



A,. 



1 

At 
1 

At 



/apdt = I([ip)fipdAdt, (7-57) 



,dt 



I{Hm)HmdAdt. 



(7.58) 



where At is the exposure time and the integrals are conducted over the 
same time period as the sum (see figure 7.7). Similar expressions could be 
presented for t p and r m . However, these will not be required as we will be 
assuming that the planet and moon move with a constant velocity during 
transit. For this case the transit light curves become symmetric and t p and 
T m are given by the mid-times of their respective transits. 

Returning to the derivation of expressions for At and €j , and examining 



the second term of equation (7.54), it can be seen that it consists of a 



sum of random variables with zero mean. Consequently, the second term of 



equation ( 7.54 ) can only contribute to ej. As the first term of equation 
contributes only to jT p + to + At and the second term of equation 
only contributes to e,-, we have that 



7.54) 



7.54) 



jTp + t + At 



ApTp -\- AmTri 



Ap + A n 



Ap + A r 



- (jT p + t + AT)]a n (ti 



(7.59) 
(7.60) 



where the definitions of A p , A m and equation (7.59) have been used to sim- 



plify the coefficient of a n . Thus, the process of determining if a given moon 
is detectable is the process of comparing equation (7.59), the 'signal', and 



equation (7.60), the 'noise', for the set of available transits. Consequently, 



these two equations will each be investigated in turn in chapters [8] and [9] 
and combined to produce detection thresholds in chapter [10| 
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7.5 Conclusion 



A general introduction to transiting planets and the detection of moons 
of transiting planets has been presented, with the aim of introducing the 
material required for the work conducted in this Part. This was done in three 
broad stages. First the transit technique was discussed, and mathematically 
investigated, with particular emphasis on deriving expressions for the transit 
duration, and the shape of the transit light curve. Then the literature 
relating to the detection of moons of transiting planets was summarised. 
Finally, the TTV P technique was focussed on in terms of describing the 
work previously presented in the literature, discussing how my work fits 
within that context and using the definition of r to derive equations for the 
the timing perturbation due to the moon At and the timing noise €j. Now 
that the background work has been summarised and the equations defining 
the problem have been introduced, the form of At, the TTV P signal, can 
be investigated. 
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Chapter 8 



TTVp Signal caused by an 
Extra-solar Moon 



8.1 Introduction 

The first step in determining the attributes of detectable moons using the 
photometric transit timing method is to investigate the timing signal, At, 



defined in section 7.4 As observing time is limited, it would be of use to 
know the way in which the detectability of a given moon depends on the 
properties of its host planet, and thus be able to focus follow-up on the 
set of planets most likely to have detectable moons. Consequently, in this 
chapter, this signal is investigated in terms of the physical parameters of 
the system, that is, the masses and radii of the host star, planet and moon 
and the orbital elements of both the planet and moon orbits, 1 but with 
particular emphasis on the physical and orbital properties of the planet. As 
a result of processes such as motion of the planet and moon about their 
common barycenter during transit, analytically deriving the general form of 
At in terms of these elements is not a trivial problem. Consequently, three 
representative cases, which highlight the types of planet orbits likely to be 
encountered, were selected. These are: 

1. A circular planet orbit aligned to the line-of-sight, 

2. A circular planet orbit slightly inclined to the line-of-sight and 

3. An eccentric planet orbit aligned to the line-of-sight. 



1 Recall that "planet orbit" refers to the orbit of the planet-moon barycenter about the 
star and the "moon orbit" refers to the orbit of the moon about the planet. 
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In addition, for simplicity, it is assumed that the moon's orbit is circular 2 
and coplanar with the planet's orbit. These cases will be investigated in 
turn, in terms of the form and associated properties of At resulting from 
these configurations. However, in order to perform this investigation, an 
appropriate coordinate system and method needs to be selected. 



8.2 Definition of the coordinate system 

In order to calculate At, the positions of both the planet and moon on 
the face of the star need to be known as a function of time. Consequently, 
to provide a framework for this description, a coordinate system must be 
selected. It would be optimal if this coordinate system could be used to 
simply describe the systems most likely to form, easily provide pertinent 
information, such as, the position of the planet and moon on the face of 
the star, while also relating to coordinate systems used in the literature. 
To select a coordinate system with these properties, two issues must be ad- 
dressed. First, the orientation of the coordinate system needs to be decided, 
with respect to "natural" standards such as the plane of the sky and the 
line-of-sight. Second, using this coordinate system, two reference directions, 
required for the definition of the Euler angles (which define the orientation 
of the planet and moon orbits), need to be selected. For example, two of 
the coordinate axes could be selected for use as the reference directions. As 
the orientation of the coordinate axes informs the choice of the reference 
directions, the choice of coordinate axes will be discussed first. 



8.2.1 Orientation of coordinate system 

The selection of the orientation of the Cartesian coordinate system to be used 
in this analysis should be informed by both the physics and mathematics of 
the system. However, there are two main choices for the orientation of the 
three axes. 

First, one of the axes could be chosen such that it is parallel to the 
projection of the line-of-sight onto the plane of the planet's orbit (see fig- 
ure 8.1(a)). This coordinate system is physically motivated in that it has 
the advantage that it can easily describe the case where the planet's and 
moon's orbits are coplanar. As a majority of formation mechanism relevant 
to large moons, have a preference for producing moons in circular copla- 



2 The effect of a small amount of eccentricity in the moon orbit will be investigated in 
appendix [I] 
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S P M S P M 

(a) Moon orbit coplanar with planet orbit, (b) Moon orbit aligned to the line-of-sight. 



Figure 8.1: Schematic diagram of the two proposed coordinate systems over- 
laid by the planet-moon systems they are optimised to describe (see text). 
For each diagram, the star (S), planet (P), moon (M) and observer (rep- 
resented by an eye) are arranged from left to right. In addition, in both 
diagrams, a dashed line and a dotted line is used to show the alignment of 
the moon's orbital plane with respect to the planet's orbital plane and the 
observer respectively. 



nar orbits (see section 3.3), a simple description of these cases is clearly an 
advantage. Unfortunately, as this coordinate system is tied to the planet's 
orbit, it is non-inertial, that is, if processes such as orbital precession act 
on the planet's orbit, they will also act to change the orientation of the 
coordinate system. 

Second, the coordinate system could be defined such that one axis of the 
coordinate system is oriented along the line-of-sight (see figure 8.1(b) ). This 
choice of coordinate system allows easy description of the most mathemati- 
cally simple configuration, the case where the planet and moon traverse the 
same chord of the star when they transit. In order for this to occur, the 
moon's orbit would have to be slightly inclined with respect to the planet's 
orbit. While this configuration is the most simple, there is no good physical 
reason why the moon orbital plane would have exactly the right inclination 
and orientation such that it would be specifically aligned with our line-of- 
sight. Despite this, this coordinate system has the added advantages of 
being inertial and that its two remaining axes lie on the plane of the sky. 

Fortunately, to the accuracy required for this work, these two coordinate 
systems are essentially equivalent, in that the error incurred by approximat- 
ing the planet-moon system shown in figure 8.1(a) with the one shown in 
figure 8.1(b) is not measurable. This is demonstrated below. 

To see this equivalence, consider a planet-moon pair with circular copla- 
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Figure 8.2: An enlarged version of figure 8.1(a) showing the orbital elements 
Ip, a, p and a m . The star, planet and moon are assumed to be collinear, as 
this is the configuration which leads to the maximum vertical displacement 
between the planet and the moon on the face of the star. 



nar orbits. 3 As can be seen from figure 8.2, AS, the maximum distance 
between the planet and moon chord across the face of the star is 

A5 = a m cosI p . (8-1) 

Using the most extreme value of I p such that the planet still transits, that 
is, 

R 

cosi„=— , (8.2) 
a p 

we have that 

A5 = R S — . (8.3) 
a p 

Taking a m to be that of the most distant stable moon orbit, and taking this 
to be 0.5 Hill radii, gives 

/ M \ 1 / 3 

A4 = Bxa5 Ut) ■ < 8 - 4 » 

Consequently, the maximum deviation possible for the cases where M p /M s 
is equal to 10" 2 , 10" 3 and 10" 4 , is equal to 7.5%, 3.5% and 1.6% of a stellar 
radius respectively. As moons are likely to form close to their host planet 



3 The increase in A<5 corresponding to a small rotation, is maximised for orbits initially 
aligned to the line-of-sight. 
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and not near the boundary for orbital stability (see section 3.3), the devia- 
tions observed for real moons are likely to be much smaller. Consequently 
a moon which is on a circular coplanar orbit can be treated as if its orbit is 
aligned to the line-of-sight and visa versa. Thus, formally, we can select one 
of these coordinate systems while still retaining the benefits of the other. To 
allow ease of mathematical description, it was decided to select a Cartesian 
coordinate system with one axis pointed along the line-of-sight and the two 
remaining axes in the plane of the sky. In addition, in order to take advan- 
tage of the mathematical intuition associated with the Cartesian coordinate 
system, it was decided to define the x and y-axes to be in the plane of the 
sky, aligned such that the y-axis points north, such that the position of the 
planet and moon on the face of the star are given by their respective x and 
y-coordinates, and the z-axis to lie along the line-of-sight. 



8.2.2 Selection of reference directions 

In order to describe the orientation of the moon's and planet's orbits, a 
reference direction, required for the definition of the inclinations, I p and I m , 
and a second reference direction required for the definition of u p and uj m , the 
periapse arguments, and O p and f2 m , the longitudes of the ascending node, 
needs to be selected. With this in mind, the selection of the two reference 
directions will be discussed in turn. 

The selection of the reference direction for the definition of inclinations 
was informed by the conventions present in the literature. In the transiting 
planet literature, the line-of-sight is generally used as the reference direction, 
for example, planetary orbital inclinations are given as the angle between the 
planetary orbit normal, and the line-of-sight. Consequently, for this work, 
the line-of-sight was used as the reference direction for both the planet and 
moon orbits. 

For this work, the rr-axis is selected as the reference direction for lon- 
gitudes (recall that the y-axis is defined to be north). While this selection 



was informed by the coordinate systems used in the literature (e.g. Murray 



& Dermott, 1999, p. 48), the choice is slightly arbitrary as the coordinate 
system will be rotated about the z axis by — Q p in section 8.2.3 for mathe- 



matical convenience. 

These two choices and the consequent definitions of I p , I m , u p , u m , Q p 



and Q, m are summarised in figure 8.3 Now that the coordinate system has 



been defined, it can be used to describe the path of the planet and moon 
across the face of the star during transit. 
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Figure 8.3: Schematic diagram of the coordinate system. The diagram on 
the left shows the position of the planet (large circle) and moon (small circle) 
on the face of the star during transit. The dashed line indicates the path 
of the planet-moon barycenter across the face of the star. The diagram on 
the right shows how the orientation of a general orbit is related to its three 
Euler angles (I, and oj), noting that n is a unit vector normal to the 
plane of the orbit. This is the system used to describe the orientation of the 
planet's orbit in terms of the Euler angles I p , Q p and u p , and the orientation 
of moons orbit in terms of the Euler angles I m , £l m and u m . 



8.2.3 Position of the planet and moon on the face of the star 

Now that a coordinate system has been selected, the positions of the planet 
and moon on the face of the star can be specified as a function of time. 



Using the coordinate system shown in figure 8.3, rotating the planet and 



moon orbits by their associated Euler angles gives 

x p = r p cos £l p cos(/ p + uj p ) — r p sin tt p cos I p sin(/ p + u) p ) 
M m 

- Y7 r m [COS Q, m COs(/ m + Wm) sin d m cos I m sin(/ m + u m )\ , (8.5) 



, = r p cos Q p cos(/ p + Up) — r v sin Q, p cos I p sin(/ p + u p ) 
+ Y7^ r m [ cos cos(/ m + 

^m) sin r2 m cos I m sin(/ m + u m )] , (8.6) 
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Up = Tp sin Jlp cos(/ p + ojp) + r p cos Q p cos I p sin(/ p + 



0J n 



M m 

— — r m [sin Q m cos(/ m + u m ) + cos fi m cos I m sin(/ m + w m )] , (8.7) 



2/m = sin f2 p cos(/ p + u> p ) + r p cos f2 p cos I p sin(/ p + cj p ) 

+ T7^- r m [sin O m cos(/ m + Lo m ) + cos fi m cos I m sin(/ m + w m )] , (8.8) 

where M pm = M p + M m . In addition, for these equations r p and r m are 
given by 

a„(l — el) 

- - PK pJ (8.9) 



1 + e p cos / p 

Qm(l ~ 6 m ) 

1 + e m cos / m ' 



(8.10) 



and e p and e m are the eccentricities of the planet's and moon's orbit respec- 
tively. The true anomalies of the planet and moon orbits, f p and f m , are 
related to time through 



cos E p — e p 



cos/ p = - (8.11) 

1 — e„ COS tin 



cos E m — e m 

cos/ m = — , (8.12) 

1 — e m cos E„ 



and 



n p (t — to) = E p — e p sin E p , (8.13) 

n m (t - t ) = E - e m sin E m , (8.14) 

where n p and n m are the mean motions of the planet's and moon's orbit 
respectively. 



Equations (8.5) to (8.8) can be simplified further. These equations de- 
scribe a transit where the path of the planet-moon barycenter across the star 
makes an angle of Q p with the x-axis. For purely mathematical reasons, it 
would be useful if this path was parallel to either the x or y-axes. As the 
shape of the transit light curve does not depend on f2 p (see appendix [G]) , 
the coordinate system can by rotated about the z-axis by — £l p , effectively 
setting £l p to zero and making the path of the planet-moon barycenter across 
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I 

Figure 8.4: Schematic diagram of the positions of the planet and moon 
during transit, after rotation of the coordinate system by —Q p . Note that 
the x-axis is now parallel to the chord made by the planet-moon barycenter 
and the positive y-axis now bisects this chord. 



the face of the star horizontal. Performing this rotation (see figure 8.4), and 
simplifying gives 



M m 

x p = r p cos(/ p + Up) - — — r m [cos(O m - Q p ) cos(/ m + u m ) 



M, 



pin 



+ sin(f2 m - Q p ) cos I m sin(/ m + u m )] , (8.15) 



M, 



x m = r p cos(/ p + u p ) + — — r m [cos(O m - Q p ) cos(/ m + u r , 



prn 



+ sin(f2 m - Q p ) cos I m sin(/ m +%)] , (8.16) 



y p = r p cos Q p cos I p sin(/p + u p ) 



Mr, 



[s\u{Q m - Q p ) cos(/ m + u v 



prn 



+ cos(fi m - Qp) cos I m sin(/ m + u m )] , (8.17) 



Dm = r p cos Q p cos I p sin(/ p + w p ) + — — r m [sin(Q m - O p ) cos(/ m + 



prn 



+ cos(r2 m - Qp) cos I m sin(/ m + w m )] . (8.18) 
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Now that equations describing the position of the planet and moon on 
the face of the star have been derived, we can begin to investigate what 
method would be most useful to determine At using these equations in 
terms of the three special cases under consideration. 



8.3 Discussion of method 



8.3.1 Introduction 



Now that we have expressions for the location of the planet and moon on 
the face of the star as a function of their orbital elements and time (equa- 



tions (8.15) to (8.18)), we can combine these with expressions for the amount 



of light blocked by these objects, a p and function of their position 



mi -Ap 



(equations (7.49) and (7.50)) and consequently calculate At via t p , t, 
and A m using equation ( 7.59[ ). 

Unfortunately, the case where a p and a m are dominated by non-uniform 
motion across a limb darkened star is not mathematically simple. For ex- 
ample, consider the calculation of A p via a p . As can be seen from equa- 



tion (7.49), a p is defined in terms of a surface integral with domain S> 



where the domain represents the region of the face of the star physically 
occluded by the planet. Consequently, the shape of this domain S p depends 
on whether the planet is on the face of the star (S p is circular) whether the 
planet is in ingress or egress (S p is lens-shaped) or whether the planet is 
off the face of the star (S p is non-existent). In order to perform the time 



integral required to evaluate equation (7.49), the equation which defines A 



in terms of a p , the shape and location of S p , a function of planet position, 



needs to be determined as a function of time. As equations (8.15) to (8.18), 
the equations which relate planetary position to time, are transcendental 
equations in time, this is not a trivial problem. So, while expressions for 
A p and similarly A m , can be derived, they are so complex that they do not 
give much physical insight into this system. Consequently, we need to use 
physically appropriate approximations to simplify the equations, especially 



equations (7.49) and (7.50), in order to highlight the underlying physics. 



As a result, it was assumed that the planet and moon move across the 
face of the star with constant, but not necessarily equal, velocities. This 
approximation was selected as first, the characteristic timescale over which 
the orbital velocity of the moon changes, e.g. its orbital period, is generally 



much larger than a transit duration, (see tables 7.1 and 7.2) and second, it 



results in substantial mathematical simplification. Recall from section 7.2 



that for the case of uniform motion, transit light curves become symmetric 
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about their midpoint. This simplification results in two important proper- 



ties, which will be stated here, but derived in section 8.3.2 First, t p and r TO 
become the time-coordinates of the lines of symmetry of a p (t) and a m (t) 
respectively. Second, the equations for A p and A m reduce to the product of 
the transit duration with a geometric term, which depends on the transit 
geometry, planetary and stellar radii and stellar limb darkening parameters. 
To take advantage of these properties, a practical method for implementing 
this assumption needs to be investigated. 

8.3.2 Implementation of method 

The simplest way of implementing this approximation is to determine ti np , 
teg,p, tin,m and t eg>m , the time of ingress and egress of the planet and moon 
respectively, and use them to calculate r p , r m , A p and A m and thus At. 

8.3.2.1 Evaluating r p and r m in terms of t intP , t e9jP , t irhm and t e9tm 

For the case of r p and r m , writing these quantities in terms of U njP , t eg , P , 
tin,m and t e g t m is simple as a p (t) and a m (t) are symmetric, as a result of 



the physics and geometry of the system (see section 7.2). Thus, t p and r m 
should correspond to the geometric mean of the time of egress and ingress 
of the planet and moon respectively. Consequently 



$.19) 



2 ' 

teg,m + Hn^m 



8.3.2.2 Evaluating A p and A m in terms of ti UtP , t eg , p , ti n , m and t e 
For the case of A p and A m we have that 



A v = Xt ap{t)dt (8 - 21) 



A m = ^~ t I a m (t)dt (8.22) 

As can be seen, finding a simple way of writing A p and A m in terms of of U niP , 



teg,p, Un,m and t eg>m , will take a little more work. Consider equation (8.21). 
A p is defined as the integral of a complicated function of t, with the limits 
on the integral being given by a function of the variables of interest, tin tP 
and t egtP . . .not an optimal format! To separate the dependance of A p on 
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the times ti np and t eqp , from its dependance on the brightness profile of the 



star, equation (8.21) was rewritten using 8 P , the projected distance between 



the planet and the center of the star, given by 

S P (t) = \l5 2 min + {{t-r p )v tr , p )\ (8.23) 

where 



Vtr,p = ~i • (8.24) 



where S m i n is the smallest projected distance between the center of the 
planet and the star during transit and where S, which for this application is 



S p , and Smin are both shown in figure 7.3 



Dividing the integral into two halves to account for the different be- 
haviour of S p for the cases t < t p and t > t p , using equation (8.23) to 
substitute for t, and using S p as the integration variable, equation (8.21) 
becomes 

^ =At J Rs+Rp ^dsf* + At J Smin a ^ ] dS- p d ^ (8 - 25) 



1 f Smin a p (S p ) x -Sp dS 



At 



Rs+R P v tr+p JSl - S 



p 



mill 



At 



'•ir+pJS^-S^ 



'.iP-fc (8-27) 



v tr+p AtJ Smin J ft- p . 

y P mm 

Now, consider the case where the planet has no moon. In this case the 



transit velocity will be given by vt r , so, from equation (8.27) we have that 



y p mm 

where the hat has been added to show that this is a comparison case. As- 
suming that S m i n is the same for both cases (true for the three special cases 
investigated in this chapter), we can write A p in terms of A p using equa- 
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tion (8.28 



■ ), and thus equation (8.27) can be written as 



A 



p — -ft-pi 
Vtr+p 



teg,p __ v 2 



2jm 



5 2 ■ 

mm 



.29) 
.30) 



Note that the necessity of performing the surface integral in equation (7.49) 
is now entirely avoided. Similarly, we have that 



teg,m tin,m 

2\ R% — <£L.-_ 



VtrA. 



where 



A - 1 1 

A m — 2 — 

Vtr At J/: . 

u min 



Rs+R 

n 



51 - SL. 



-dS m . 



5.31) 



5.32) 



To demonstrate that the expressions derived for r p , r TO , A p and A m are 
valid for the systems of interest, the quantities given by equations (8.19), 



5.20), (8.30) and (8.31) were compared to the full expressions using a sim- 
ulation. 



8.3.3 Validation of approximation of uniform velocities 



To investigate the validity of the assumption that the planet and moon 
move with uniform velocity during transit, a set of simulated light curves 
were constructed with the aim of comparing the values of T m and A m calcu- 
lated directly from the light curve, with the approximate values of r m and 
A m calculated using equations (8.20) and (8.31). This simulation focussed 



specifically on the dip caused by the moon for a range of reasons which will 
be described in the next paragraph. The set of conditions under which the 
assumption of uniform velocity during transit breaks down was investigated 
by considering an approximate expression for the maximum change in the 
velocity of a moon during transit. In particular it was found that high values 
of v m /vtr or R s /a m could result in non-uniform moon (or planet) motion. 
Consequently four simulations were run looking at the transit of the moon 
for a range of different values of v m /vt r and R s /a m . These issues, and the 
results of the simulations will be discussed in turn. 

The dip due to the moon was chosen as the target for these simulations 



for three main reasons. First, as was shown by (Szabo et al. 2006) and as 



will be shown in section 8.4.1 the value of At is dominated by the effect of 
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i/Vtr = 0.33. 



Figure 8.5: Diagram showing the shape of the transit light curve of the 

moon (black line) as a function of orbital phase / m (0) for four different 

combinations of moon orbital radius and velocity ratio. The time axis is 

defined such that the planet-moon barycenter takes two time units to cross 

the face of the star, consequently on average the dip caused by the moon 

should be two time units long. In addition, the depth of the dip is scaled 

such that the area of dip caused by a moon moving at the same velocity as 

the planet-moon barycenter would be equal to one. The true and predicted 

values of r m are shown as blue and red bars on the transit light curves, while 

the true and predicted areas of the transit light curves are displayed to the 

right of each light curve, again in blue and red. 
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the moon. Second, the orbital motion of the moon around the planet-moon 
barycenter is much more pronounced than the orbital motion of the planet 
around the planet-moon barycenter, and consequently any asymmetry in the 
transit light curve caused by this motion will also be more pronounced for 
the light curve corresponding to the moon as opposed to that corresponding 
to the planet. Third, for the case where the moon is small with respect to 
the star {R m <C R s ) and the planet (M m <C M p ), the orbit of the moon 
becomes independent of it's mass 4 and the shape of the light curve becomes 
independent of the radius while its depth becomes proportional to the cross- 
sectional area of the moon. 5 In addition, for simplicity, it was decided to 
simulate the case of circular coplanar orbits. Also, it was assumed that the 
planet and moon transited the central chord of the star as these systems 
suffer from the most extreme transit distortions, for example, due to the 
associated longer transit durations. The light curve caused by a moon was 



modeled by using equations (8.16) and (8.18) to determine the position 
of the moon as a function of time, and equation (7.50) to determine the 
corresponding value of a m . 

In order to focus our investigation on the region of parameter space 
where the assumption of uniform velocities breaks down, and consequently 
select representative scenarios for the numerical simulations, we begin by 
estimating the degree to which velocity of the moon across the plane of the 
sky changes during transit. In particular, the maximum change in velocity 
which can occur during transit is equal to 

max(Aw) « T tra x max(t) m ), (8.33) 

where Tt ra is the transit duration, and max(u m ) is the maximum acceleration 
along the plane of the sky. For an moon on a circular orbit, the maximum 
acceleration, and thus the maximum possible acceleration along the plane 
of the sky is given by 

v 2 

max(%) < — . (8.34) 



4 Recall that the orbital mean motion of the moon is given by (G(M P + MmJ/ai) 1 ' 2 . 
Thus for M m < M p , n m w (GM p /a m ) 1/2 ■ 

To see this, consider a generic transit light curve consisting of ingress, eclipse and 
egress. The duration of ingress and egress is proportional to the radius of the moon (see 



appendix [Ep , while the duration of the transit is of the order of R a (see section 7.2|. 
Consequently, in the limit that the moon is small, the transit light curve is dominated by 
the eclipse portion and the ingress and egress can be neglected. In addition, as the moon 
becomes smaller, the region of star that it is blocking becomes more homogeneous. As a 
result, in this regime changing the size of the moon does not affect the shape of the light 
curve, only the relative depth. 
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In addition, the maximum value of Tt ra can be approximated using equa- 



tion (7.18). Consequently, 



/ a \ , 2i? s V m 
max(Af J < v r , 

0"m 



(8.35) 



Noting that v m is proportional to (R s / a m ) l l 2 we have that for a given star 
(constant R s ) 



max(Au) oc 



Vtr' 



(8.36) 



Consequently, the change in velocity for a given star depends on two ratios, 
v m/ v tr and R s /a m . As a result of this dependance, moon light curves were 
simulated for a range of values of v m /vt r and R s /a m . As the analysis in 
this chapter is limited to v m /vt r < 0.66 as a result of the expansion that 
will be used to derive ingress and egress times, it was decided to investigate 
moon light curves using two different values of v m /vt r - First, the case of 
Vm/vtr = 0.66 was investigated as it corresponds to the worst case scenario 
that can be described by this analysis. Second, the case of v m /vt r = 0.33 
was investigated as a comparison case. The selection of appropriate values 
of a m was informed by the regular satellites in the Solar System. As these 
satellites are found between 0.27 (Ariel) and 2.7 (Callisto) solar radii from 

it was decided to investigate the cases where o m 
and 2R S . These simulations were performed, and the 



their hosts (see table 7.2 
is equal to 0.5i? s , R s 
resulting light curves are shown in figure 8.5 



While distortion of the light curves is evident (see figure 8.5), the posi- 
tion of the true first moment, compared to the value calculated from equa- 



tion (8.20), agree well for the majority of the orbital period of the moon 
for v m /vtr up to 0.66 and a m /R s > 1. For moons with a m /R s < 1, there 
is some disagreement. However, for a m /R s < 1 the signal caused by the 
moon is small and so it is likely that moons for which the assumption of 
uniform motion does not apply, will also not be detectable. In addition, as 
can be seen from figure [83J for the case of a m = 0.5i? s , the magnitude of the 
predicted value of r m is always less than the magnitude of the true value. 
Consequently, using these assumptions, we will still be able to place limits, 
albeit generous, on the population of these inner moons. Finally, values of 
A m were calculated using equation (8.31) and all agree well with the values 
predicted for A m given by the simulation. Consequently the effect of the 
non-uniform motion of the moon on the value of r can be safely neglected, 
and equations 



.19), (8.20), (8.30) and (8.31) used. 
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Moon Orbit 




Figure 8.6: Schematic diagram of the same form as figure 8.4 of the coor- 
dinate system for the case of circular coplanar orbits. In particular, it is 



assumed that I p = tt/2, I m = tt/2 and Q Ti 



n 



V 



8.4 Circular planet orbit aligned to the line-of- 
sight 

Now that the preliminary work in describing a coordinate system and de- 
termining a method for deriving Ar is complete, we can concentrate on 
determining the effect of the physical parameters of a given planet-moon 
system on At. We begin with the first, and simplest, of our three special 
cases, the case where both orbits are circular and coplanar, and both the 
planet and moon transit the central chord of the star. These assumptions 
result in substantial simplification. As both orbits are circular (e m = and 
e p = 0), equations (8.9) and (8.10) simplify to r m = a m and r p = a p , that is, 



both r m and r p are constant. Similarly, equations (8.12) and (8.14) also sim- 
plify to f m = n m t + /m(0), that is, the planet-moon pair progresses around 
its orbit with constant angular velocity. Also, we assume that the planet 
and the moon's orbit are aligned with the line-of-sight, that is, I p = tt/2, 
Im = tt/2 and fi m = Q, p (see figure 8.6). Now we are in a position to use 
the coordinate system discussed in section (8^2] and the method described in 



section 8.3 to investigate Ar for this special case. 



The first stage in deriving Ar is to determine ti np , t 



1 in. my ^eg,P t e g^ m , 



the times of ingress and egress for the transit of the planet and moon. We 
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begin with the equations describing x 



y p and y m , equations (8.15) 



to (8.18). Using the simplifications described above, and ignoring the y 



components as they are identically zero, the position of the planet and moon 
along the chord on which they are transiting is given by 



a p cos(f p + ojp) 



M m + M, 



-a m cos(n m t + fm(0)+ujm), (8.37) 



Xm = a p cos(/ p + Up) + 



Mr, 



M m + M p av ' 



cos(n m t + / m (0) + u m ), (8.38) 



where x p , x m , lo p and oj m are defined in figure 8.6 and where the first term 



represents the motion across the face of the star due to the motion of the 
planet-moon barycenter, and the second term represents the motion of the 
planet and moon about the planet-moon barycenter. For the case where the 
orbital period of the planet is much longer than the transit, the motion of the 
planet can be accurately approximated by uniform motion. Recalling that 
we do not expect short period planets to host large moons and expanding 
the first term of equations (8.37) and (8.38) about t = jT p + to, the central 
time the j th planetary transit would have occurred if there were no moon, 
gives 



Vtr(t- (jT p + t )) 



Vtr(t ~ {jTp + t )) + 



M n 



M m + M, 



M n 



M m + M, 



-a m cos(n m t + f m (0) + u m ), (8.39) 
■a m cos(n m t + f m (0) + u m ), (8.40) 



where vt r = a p n p is the velocity of the planet-moon barycenter across the 
face of the star. 



An alternative way of viewing equations (8.39) and (8.40) is that they 
implicitly define t for a given x p or x m - In particular, this equation defines 
the ingress and egress times of the planet and moon when the values of x p 



and x m on the left hand side of equations (8.39) and (8.40), correspond to 
the limb of the star. 

The position of the limb of the star for this transit geometry is given by 



Xp 
Xm 



±Rs. 



(8.41) 
(8.42) 



Consequently, the equations describing the ingress and egress times of the 
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planet and moon's transit can be denned implicitly through 



— R s — Vtr{tin,p — jT p — to) 



a m M m 
M m + M p 



COS (jlrn^in,p 

+ f m (0) + u; m ), (8.43) 



Rs — Vtr(tin,m jTp to) 



+ 



a m Mp 
M m + M. 



+ f m (0)+u m ), (8.44) 



Rs — Vtr{t e g,p — jT p — to) 



(l rn ]\ / I rn 

M m + M p 



COs{jl m teg,p + / m (0)+o; m ), (8.45) 



R s = v tr (t, 



eg,m 



jTp - t ) 



+ 



a m Mp 



+ f m (0)+u m ), (8.46) 



where tj njP and tj njm are the ingress times for the planet and moon and 
where t e9iP and t e9 ,m are the corresponding egress times. 



The argument of the cosine function in equations (8.43), (8.44), (8.45) 



and (8.46) is a measure of the position of the moon around its orbit during 
planetary ingress, moon ingress, planetary egress and moon egress respec- 
tively. Setting 9 = f rn +u) rn +ir/2, we have that 9 in>p = n m t in ^+ f m (0)+u m + 

@eg,p — 17"mteg,p /m(0) ~\~^m ~\~ 9in,m — Turrit in, m /m(0) ~l~ ^ /2 

and 6, 

into the above equations and rearranging gives 



eg,m = n m t egtrn + f m (0) + io m + 7r/2. Substituting these expressions 



/m(0) + UJ r , 



7T 



7T 



+ n m (jT p + 1 ) 



/ m (0) + uj m + - + n m (jT p + t ) 



7T 



/ m (0) + w m + - + n m (jT p + t ) + 



7T 



/ m (0) + w m + - + n m (jT p + t ) + 



TlmRs 
Vtr 


%n,p 


sm(6in :P ), 

Vtr 


(8.47) 


TlmRs 
Vtr 


%n,m 


i Vm ■ la \ 

H sm.{9 inm ), 

Vtr 


(8.48) 


'TI'mRs 
Vtr 


@eg,p 


^ sin(0 esp ), 

Vtr 


(8.49) 


'TI'mRs 
Vtr 


9eg,m 


i Vm ■ la \ 

H sm(d egm ), 

Vtr 


(8.50) 
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where v p and v m , the velocity of the planet and moon about the planet-moon 
barycenter are defined as 



M„ 



pin 



M„ 



.1/ 



(8.51) 
(8.52) 



prn 



where M pm = M v + M„ 



Equations (8.47), (8.49), (8.48), and (8.50) can be written as 



$ = CC + B sin( 



(8.53) 



where the subscript "cc" implies that the orbits are circular and coplanar, 
where $ and B are known constants, and where B is the ratio of the ve- 
locity of the planet or moon around their barycenter to the velocity of the 
barycenter around the star. For reference, the $ and B corresponding to 



each of the four equations are given in table 8.1 



Now is a good time to pause and take stock. Equation (8.53) exactly 
describes 9 CC in terms of $ and B for all values of and B. While we could 



solve equation (8.53) numerically for a grid of $ and B values represent- 
ing all the values of R s , n m , / m (0) + oj m and vt r of interest, this approach 
is not optimal for three reasons. First, a numerical approach may lead to 
canceling errors which would not occur if a more analytic approach were 
employed. For example, consider the dependance of the detection threshold 
on A p +A m , a quantity dominated by the size of the planet. As will be found 
in sections 8.4.1.1 and 9.2.1 both At and €j are inversely proportional to 
A p + A m . Consequently, when the ratio of the amplitude of At and e,- is 
formed to determine the detection threshold, as will be done in chapter [TO], 
it should be independent of A p + A m , and thus planetary radius. However, 
if a numerical approach was employed, we would find that At was approx- 
imately proportional to A p + A m and consequently that the ratio between 
At and €j may not be independent of A p + A m . As the difference between 
a slight dependance of At on A p + A m and no dependance is scientifically 
interesting, the analytic approach is preferred. Second, the expressions de- 
rived to evaluate At assume that either v m /vt r is small, or R s /a m is small. 
There is little point obtaining precise numerical values for CC if they are to 
be used in an approximate expression, and consequently will not yield more 
precise values of At. Third, for the case where the moon orbit is no longer 



circular and coplanar, the form of equation (8.53) and the form of $ and B 



will change. Consequently a new numerical grid will have to be evaluated 
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X 


$x 




in,p 
in, m 

eg,p 
eg, m 


/ m (0) + uj m + § + n m (jT p + t ) - ^ 
/m(0) + 0J m + f + n m (jT p + t ) - ^ 
/ m (0) + w m + § + n m (jT p + t ) + ^ 
/ m (0) + w m + § + n m (jT p + t ) + ^ 


Up 

Vtr 
Vm 

Vtr 
V p 

Vtr 
Vm 
Vtr 



Table 8.1: The values of and B corresponding to equations (8.47) to 
([8T501). 



for each new moon orbit. Alternatively, as demonstrated in appendix |IJ an 
analytic method can be extended to investigate different types of moon or- 
bits without evaluating numerical grids of $ and B values. As a result of 
these reasons, the approach that is used, is to approximate the solution of 



equation (8.53) using an analytic expansion. In particular, the expansion 



that was been selected is most accurate for small values of B, and can be 
extended to other types of moon orbits. 



We begin by noting that equation (8.53) is mathematically equivalent to 
Kepler's equation 

M = E + esinE, (8.54) 

where M, the mean anomaly, is equivalent to and E, the eccentric 
anomaly, is equivalent to 9 CC and e, the orbital eccentricity, is equivalent 
to B. As equation (8.54) can be expanded to give an explicit expression 
for E in terms of M and e, equation (8.53) can be expanded to give an 
expression for 9 CC in terms of and B. 



Following Murray & Dermott ( 1999 , p39) and writing sin £ as a Fourier 



series in M, equation (8.54) can be written in terms of Bessel functions, as 
follows 



E = M + Y^ T J k (ke) shx(nM), 
k 



k=l 



.55) 



where Jk(x) is a Bessel function of the first kind (e.g. 
p. 150), defined as 

-iy(x/2) k+2 i 



Spiegel fc Liu, 1999 



Jk(x) = 



3=0 



j\T(k + j + l) 



.56) 



where T is the gamma function. For reference, expansions to order x 5 for 
Bessel functions with k = 1 ... 5 are presented in table 8.2 While the 



solution of equation (8.54) is true for all values of M and e, the expansion 
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Bessel Function Expansion to 0(x 5 ) 

Mx) - ±x s + ^x 5 + 0(x 7 ) 

J 2 (x) \x 2 - + 0(x 6 ) 

■M*) ±x 3 -±.x 5 + <D(x 7 ) 

J 5 (x) ^^s + o^ 7 ) 

Table 8.2: List of the Taylor expansion of the first five Bessel functions up 
to order x 5 . 



given by equation (8.55) only converges for e < 0.6627 (Hagihara, 1970, p. 
510) and can consequently only be used when e < 0.6627. By analogy, we 
also have that 

oo 

cc = <S> + Y.i: J k{kB)sin{k<5>), (8.57) 



k 

k=l 



for cases where \B\ < 0.6627. 



There are two possible issues with using equation (8.57) in its present 



form to derive ti n>p , t egtP , ti n ^ m and t e g,m & n d thus At. The first issue is 
that the expansion may fail, that is, that \B\ > 0.6627. The second issue is 
that the expression, in particular, the infinite sum, is functionally complex. 
Fortunately, both these problems are simply resolved as, first, excluding 
the region in which the expansion fails does not exclude many physically 
realistic moons, and second that most of the terms in the infinite sum can 
(and will) be neglected as they do not substantially increase the accuracy of 
the calculated value of At. These issues will be discussed in turn. 

As \B\ can correspond to both v m /v tr and v p /vt r , and v m > v p (as 
M m < M p ), the requirement that \B\ < 0.6627 corresponds to the limit 
of v m /vtr < 0.6627. Fortunately, this limit does not strongly restrict the 
range of detectable moons, for example, for the Jupiter-Callisto moon system 
were at 1AU and transiting a Sun-like star, the velocity ratio would be 
approximately 0.28. The regions where this expansion fails are shown in 



figure [8/j] for the cases where M p /M s = 10~ 2 , M p /M s = 10~ 3 and M p /M s = 
10 -4 . As can be seen, the motion of the moon is well described by this 
expansion for most of the range of planet masses and radii which are likely 
to be detected. 

In addition to the region where the expansion fails, there is also the 
region where the assumption that the planet and moon have constant veloc- 
ities during transit leads to substantial inaccuracies in the calculated value 
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Figure 8.7: Diagram showing the position of the three body instability 
boundary (dashed line) and the boundary of the region where the ex- 



pansion given by equation (8.57) fails (solid line), for three values of the 
planet star mass ratio. For comparison, for the case of M p /M s = 10 -3 and 
M, p /M s = 10~ 4 , the semi-major axis of the regular satellites of Jupiter and 
Uranus respectively are also shown (grey lines). 
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of At. As mentioned in section 8.3.3 the assumption of constant velocities 



is most accurate for planet moon pairs with small values of v m /vt r (that is, 
B) or a m /R s . In addition, as also mentioned in section 8.3.3, as it is difficult 
to detect moons with small values of a m /R s , B is a good indicator of when 
the assumption of uniform velocities is likely to fail. So for the case where B 
is small, only the lowest order terms in B are required (as B is small), and 
for the case where B is not small, additional terms do not substantially in- 
crease the accuracy in the final calculated value of At as the approximation 
of constant velocities becomes increasingly inaccurate (see figure 8.8 in sec- 
tion 8.4.1.2). As a result, the infinite sum can be approximated by the first 



couple of terms, and the Bessel functions can be approximated by the first 
couple of terms in their Tailor expansion. 6 In particular, in section 8.4.1 



the section where expressions for At are calculated, At will be calculated 



to first order (section 8.4.1.1) and second order (section 8.4.1.2) in B. 

1 in. in and t t 



Continuing, in order to derive expressions for ti UjP , t eg ^ p , U 



and consequently At, equation (8.57), the expression for 9, 



eg, mi 

in terms of 

$ and B must be reformatted to give expressions for tin :P , t eg ^ p , ti n ^ m and 

the ingress and 



t 



eg,m- 



Using equation (8.57), and the values in table 



8.1 



egress times of the moon can be written in terms of the planet and moon 
masses and orbital parameters. This gives 



t 



jT p + 1 



R 12 

— H ^2 T, J k{kB in ^ p ) sm(k®in, P ), 

k=l 

R 1 °° 2 

tin,m = jTp + to I / , T^k\kBi n m ) sin(K < I ) j n m ) 

R 1 °° 2 
t eg ,p =jT p + t -\ H V, -J k (kB eg , p ) sm(k$ eg , p ), 



t 



Vtr 



R.i 



fc=i 

oo 



eg,m 



jT p + tQ + — + 



Vtr 



n 



eg,m 



eg,mj: 



(8.58) 
(8.59) 
(8.60) 
(8.61) 



fc=i 



where we have left the $ and B terms in the sum for readability. The form of 



equations (8.58) to (8.61) is reassuring in that it is exactly what we would 



expect. For example, equation (8.60) indicates that the planet's time of 



egress is the sum of the the time we would expect the planet to reach the 
middle of the stellar disk (jT p + to), the time it would take a lone planet 



6 For example, to first order in B, only the k = 1 term in equation (8.571 contributes, 
so only it has be be retained. For the case where equation (8.571 is taken to order B 2 only 
the k = 1 and k = 2 terms contribute. 
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to travel from the center to the limb of the star (R s /vt r ), and an additional 
modifying term (the infinite sum), indicating the effect of the moon. 



Now that expressions for £j n>p , t eg ^ p , ti n , r ,i 



and t eq m have been determined, 



equations (8.19), (8.20), (8.30) and (8.31) from section 8.3.2 can be used to 



,n, A p and A m , and consequently derive At. For 



write expressions for r p , r, 
simplicity, these expressions will be calculated in the following section, after 
the equations have been reduced to the appropriate order. 



8.4.1 Form of A 



T 



can be derived, At will be 
investigated. As equations (|8.58l) to (|8.61l) are quite complex, in order to 



Now that expressions for r p , r m , A p and A r , 



build mathematical intuition, At will be investigated for two cases. First, 
At will be investigated for the case where motion during transit is negligible. 
Second, At will be investigated for the case where motion during transit is 
non-negligible, but where the largest ratio of the velocities, v m /vt r is less 
that 0.66. These results will then be combined to provide a qualitative 
description of the behaviour produced. 



8.4.1.1 Case where v m /vt r <C 1 



Expanding equations (8.58), (8.59), (8.60) and (8.61) to first order in v p /vt r 
or v m /vtr gives 



jT p + to 



Rs 

Vtr 



1 V P ■ ft t(\\ i L _i / -rp , + N n m R ^ 

sm / m (0) + Lu m + - + n m (jT p + t ) 

n m V tr V 2 v tr 



1.62) 



jT p + t 



Rs 

Vtr 



H sm / m (0) + Lo m + - + n m (jT p + t ) 

n m V tr V 2 V tr 



5.63) 



J'T„ + 1 + 



1 V P ■ ft tn\ , i i ,™ , n , 
sin Jm{0) + w m + - + n m (jT p + t ) H 

n m v tr V 2 t; ir 



.64) 
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t 



eg,m 



jT p + t + 



Rs 
Vtr 



1 V m 

fl"m Vtr 



7T 

-! sin ( / m (0) + u m + - + n m (jT p + t ) + 



Vtr 



(8.65) 



Then, substituting equations (8.58) to (8.61) into equations (8.19), (8.20), 
(8.30) and (8.31) and retaining terms to order v p /vt r or v m /vt r gives 



t p = jT p + t 



1 v P 

Vtr 



COS 



T m = jTp + t + 



f^m Vtr 



COS 



f^mRs 
Vtr 

^mRs 
Vtr 

Tim R$ 



, _ I 1 V p Vtr * . 

Jip — Zip /ip sm 

n m U( r i? s V v t 

. i 1 «m «*r i - / n m R s 

A-m — Am + j-, A m Sin I 

n m V tr R S V u tr 



x cos (fm(t ) +io m + n m jT p ) , (8.66) 

cos (fm{t ) + uj m + n m jTp) , (8.67) 

sin (f m (t ) + uj m + n m jT p ) , (8.68) 

sin (f m (to) + w m + n m jT p ) , (8.69) 



where we have used the identity that f m (to) = /m(0) + ^m^o- 

Substituting this into equation (7.59) and neglecting all terms of order 
Vp/vt r , v m v p /vf r or v^/Vfr and above gives 



A m Mp 

At = 



ApM m ct m 



Ap m Mp m 



cos 



Vtr 



V tr 



x cos (f m (to) + uj m + n m jT p ) , (8.70) 



where we define M pm = M p + M m and A pm = A p + ^4 m • 

Consequently, for the case of negligible motion of the planet and moon 
during the transit, At is given by equation ( |8.70 ). This equation can be 
further simplified by investigating the quantity (A p M m — A m M p ) / A pm Mp m . 
We begin by noting that A m /A p « R^/Rp, M p = 4Tr/3RpP p and M m = 
j"iR m p m . Consequently the ratio of the size of the first term in the nu- 
merator to the second is equal to 



A rn Af p 
A p M m 



Rp Pp 

Rm Pm 



(8.71) 



For the case where the planet is dominated by solids, for example the terres- 
trial planets, the ratio p p j p m is likely to be close to one and the ratio R p /R m 
is likely to be large. Consequently the term A m M p will be much larger than 
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A p M m . For the case where the planet is dominated by gas, for example, 
the gas giants, the ratio R p /R m will be very large, so while the ratio of the 
densities could be less than one, A m M p will again dominate A p M m . 

As a result, the term A p M m can be neglected in the numerator, to give 
the approximation 



A7 



A,. 



-M p d m 



A p m M pm v tr 



COS 



TlmRt 
Vtr 



cos(/ m (t ) + LO rn + jn m T p )). (8.72) 



As an aside, for the case where Apm « A p the neglected term corresponds 
to At p , the barycentric transit timing perturbation. Consequently, we have 
that At > At p . 



8.4.1.2 Case where v m /vt r < 0.66 

The expressions for At derived above are only correct to first order in v m /vt r 
and v p /vtr- To explore and quantify the error caused by truncating to this 
order, At will be calculated to second order in v m /v t r and v p /vt r and com- 
pared to a full simulation. 



Expanding equations (8.58), (8.59), constructing expressions for T p , T m , 



A p and A m and consequently At, and only retaining terms of order (v m /vt r 
v m Vplv1 r and (vp/v tr ) 2 gives 



At 



plVl m U> m 



Ap m Mp m 



cos 



V tr 



ApAf rn -\- AmAdp CL m Tl m CL m 



Vtr 



A M 2 

J-L pm lVl prn 



Vtr V tr 

R 



1 A m Ap CL m u m . ; /';,).'. i. s 

sm 2 



4 A 2 v tr R 



v t , 



cos (f m (t ) + uj m + jn m T p 



cos ( 2 



. n m R> 



sin (2 (f m (t ) + uj m + jn m T p )) 



sin (2(/ m (i ) + w m + jn m T p )) . (8.73) 



As can be seen, the addition of the higher order terms results in higher 
order harmonics being included in the expression for At. Physically these 



additional terms make At asymmetric (see figure 8.8) 



To determine the increase in accuracy in At, equations (8.70) and (8.73) 



were compared to At values directly calculated from a set of simulated light 
curves. For comparison purposes, it was decided to select scenarios which 



included those shown in figure 8.5 Consequently the cases where v m /vt r was 
equal to 0.16, 0.33, 0.49 and 0.66 and a m was equal to 0.5i? s , R s and 2R S 
were selected. These curves were calculated assuming that R p = 0.1R S and 
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Rm = O.OlRg and are shown in figure 8.8 This figure shows the degree of 



agreement between the two analytic approximations for At (blue and red) 
and the true value (black) as a function of the angle (/ m (*o) + w m + jn m T p ). 



As can be seen, equation (8.70), the first order approximation, reproduces 



the broad behaviour of At, in particular its amplitude, while equation (8.73 ), 



the second order equation is more successful at reproducing the finer detail 



see in particular figure 8.8(h)). Reassuringly, for the regions where the 



assumption of uniform velocities is an acceptable one (figures 8.8(a) , 8.8(b) 
and |8.8(h)| ), both approximations perform well, and for the scenario where 
the assumption of uniform velocities is not as effective (see figures 8.8(c) and 



8.5(c) ) the two expressions do not agree very well with the exact waveform. 
Consequently, for the cases where the assumptions that the planet and moon 
travel with constant velocity during transit and that v m /vt r < 0.66 hold, 



equation (8.70), the first order equation, gives a good qualitative description 



of At and equation (8.73) gives a more accurate description. 



Now that we have two physically realistic approximations for At, the 
properties of At can now be investigated. In particular, we can begin to 
consider how the form of these equations (and the physics of the system) can 
affect the amount of perturbation timing signal a given moon can produce 
and the amount of this signal which can be detected. 



8.4.2 Properties of At 

Now that we have expressions for At, it is a good time to take a step back 
and consider what these expressions tell us about the system. Recall from 



section 7.4.5 that that an observer cannot measure At directly, and can only 
measure a sequence of r values 



n,T 2 , ... ,tjv, 



.74) 



corresponding to the N measured transits. Consequently we will discuss 
the form of At for the case of circular coplanar orbits in two different con- 
texts. First we will look at the properties of At in isolation, with particular 
reference to how the amplitude depends on the physical parameters of the 
system, and how the form of At does not allow differentiation between pro- 
grade and retrograde orbits. Then we will discuss At within the context of 



being part of a signal train such as the one given by equation (8.74) 



8.4.2.1 Properties of the amplitude of At 

Intuitively it can be seen that the larger the amplitude of At, the more 
detectable the perturbation in a sequence of r values. As we now have 



165 



Angular Position of Moon (Rad) 



Angular Position of Moon (Had) 



Angular Position of Moon (Rad) 
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(g) a m =2R s , v m /v tr =0-33. (h) a m =R s , v m /v tr =Q-33. (i) a m =0.5.R s , w m /v tr =0.33. 
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(j) a m =2R 3 , v m /v tr =0.16. (k) a m =R s , v m /v tr =0.l6. (1) a m =0.5R s , v m /u tr =0.16. 



Figure 8.8: Comparison of the value of At calculated directly from simulated 
transit light curves (black), with that of equations ( 8.70| ) and (8.73), the 
analytic approximations to At accurate to first (blue) and second (red) 
order in velocity ratio. As the degree of agreement of the curves is more 
important then their exact value, and to reduce the number of independent 
variables, Arn m is plotted against the angle (/ m (io) + + jn m T p ). These 
plots were constructed for the case of a large gas giant moon, in particular, 

^ = 0MR s . 
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it was assumed that R p = 0.1 R s and R 



a number of approximations to At given by equations (8.70), (8.72) and 



(8.73), we consequently can start to look at the relationship between this 
amplitude, and the size, mass and semi-major axis of the moon. To pro- 
vide the maximum mathematical intuition with the least amount of math- 
ematical complexity, equation (8.72), the simplest equation for At, will be 



investigated. 

Recasting this equation into physical variables and grouping like terms 
in square brackets gives 



At « 0.01 



2R S 

Vtr 



2R K 



M v + M„ 



cos 



W0A m 

Ap -\- A m 

Vm Rg 



cos(/ m (t ) + w m + jn m T p )). (8.75) 



There are three terms in this equation which substantially affect the ampli- 
tude of At (on the first line) and two that don't as they are approximately 
unity (on the second line). These five terms in the amplitude will be dis- 
cussed in turn. 

The first term in square brackets is the length of the transit duration 
(see equation (7.18)). This indicates that the longer the transit duration 



(i.e. the more distant the planet-moon pair is from the star), the larger 
the amplitude of At. Also, as all other terms in the amplitude are either 
fractions of like quantities, or functions of fractions of like quantities, it is 
this term that give At the units of time. 

The second term in the square brackets is the ratio between the size of 
the moon's orbital semi-major axis and a characteristic scale size of the star, 
in this case, the diameter of the star. As can be seen, At scales with a m 
and consequently more distant (a m > R s ) moons have larger At amplitudes 
than similar moons with a smaller semi-major axis. 

The third term is a ratio between the area of the dip caused by a moon 
and the total dip area corresponding to both planet and moon, and has 
been scaled such that a Earth-like moon of a Jupiter-like planet would ren- 
der this term approximately equal to one. As dip depth is approximately 
proportional to radius of the body squared, and as planets are generally 
much larger than moons, this term can be though of a comparison between 
Rm and Rp. 

As mentioned previously, the remaining two terms do not substantially 
effect At as they are both approximately equal to one. However, they will 
be discussed for completeness. 
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The fraction M v j (M p + M m ) describes the proportion of the mass in the 
planet-moon system that is taken up by the planet. As this proportion will 
differ from unity by at most 0.01% and 4% for the case of disk generated 
and impact generated moons respectively (see section 3.3) it can be safely 
neglected. 

The final amplitude term is approximately equal to one for the case where 
moons are detectable, and the expansion is accurate. To see this, recall from 
section 8.3.3 that in order for the assumption of uniform velocities to be 
accurate, either v m /vt r or R s /a m must be small. In addition, the amplitude 
of Ar is proportional to a m /R s , so detectable moons should have appreciable 
values of a m /R s (and thus small values of R s /a m ). As a result of these two 
two factors, the argument of the cosine function is likely to be small. Thus, 
as cos# ~ 1 for 6 small, this term is approximately 1. 

As discussed, the amplitude of Ar depends linearly on the transit du- 
ration and the semi-major axis of the moon, suggesting that more distant 
planets with more distant moons have higher Ar amplitudes. In addition 
Ar is also proportional to A m /(A P + A m ) « R^/Rp, but not M m or M p . 
This suggests that it is the physical size of the planet and the moon which 
affects the amplitude of Ar, and not their masses. Now that the properties 
of the amplitude of Ar signal have been discussed some more properties of 
the signal will be described, in particular the degeneracy in Ar with respect 
to prograde and retrograde orbits. 



8.4.2.2 Properties of the form of Ar 



Ar is a function of the projected position of the planet and moon on the 
plane of the sky, and to a lesser extent their velocities in that plane. How- 
ever, there are two orbits with the same positions and velocities across the 
plane of the sky as a function of time (see figure 8.9). One corresponds to 
a prograde orbit with an initial position angle of f m (to) an d argument of 
perihelion of w m , while the second corresponds to a retrograde orbit with ini- 
tial position angle — f m (to) and argument of perihelion of — ui m . To see this 
mathematically, replace f m (to) with —f m (to), uJ m with —io m and n m with 
in any of equations ( 8.70| ), 



72) and (8.73) and note that the form 



of Ar remains the same. Consequently Ar cannot be used to distinguish 
between prograde and retrograde orbits. This is unfortunate as different 
formation mechanisms predict different types of orbital structures, e.g. cap- 
tured moons are likely to be retrograde, while regular moons are likely to be 
prograde (see section 3.3). Thus TTV P (and all other transit-based moon 



detection techniques) cannot be used to investigate moon properties with 
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Figure 8.9: An example of a prograde (left) and retrograde (right) system 
that will show the same silhouette (middle) during transit, and thus have 
the same At values. The two systems are shown from above, such that 
the yellow hemisphere and the eye represent the star and the observer, and 
the vertical sequence of diagrams show the relative planet-moon orientation 
corresponding to a sequence of four consecutive transits. The silhouettes 
are shown from the point of view of the observer, where the large and small 
black dots mark the mid-transit position of the planet and moon respectively 
and an arrow is used to show the direction and magnitude of the transverse 
orbital velocity of the moon in its orbit about the planet. 



respect to whether orbits are prograde or retrograde. 

This result is very robust in that inclined or eccentric orbits suffer from 
the same issue 7 and that mutual events (moon passing in front or behind 
planet) do not help differentiate between the cases. Only a direct mea- 
sure the effect of the moon on the planet's light curve or spectra (see sec- 
tion 4.3.5), can break the symmetry and differentiate between prograde and 
retrograde orbits. 

This symmetry leads to one more interesting property, At depends on 
a m alone, and not the sign of n m . Consequently any detection plot produced 



7 In these cases prograde orbits pair up with the retrograde orbits which have been 
reflected across the plane of the sky. 
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with a m as one of the axes can represent both prograde and retrograde orbits. 



8.4.2.3 Properties of the signal containing Ar 

Now that the properties of At have been discussed, we can look at the 
"size" of the detectable component of Ar within the context of a signal, 



such as the one given by equation (8.74). As the TTV P detection process 
involves fitting a linear (to + jT p ) and a quasi-sinusoidal function (Ar) of j, 
the transit number, to the r values, the pertinent amplitude is not actually 
the amplitude of Ar, but the amplitude of Ar once linear trends have been 
removed, that is, the amplitude of the perturbation that would be seen in the 
corresponding O-C diagram. This process leads to a number of behaviours, 
and in particular, we will concentrate on two of these. First, as Ar is aliased, 
moons with the same aliased orbital frequency will display similar behaviour 
in terms of the size of the detectable component of Ar in their corresponding 
sequence of r values. Second, moons which complete an integer number of 
orbits each planetary year will not be detectable. These "non-detection 
spikes" are due to the fact that the position of the moon relative to the 
planet during transit will be the same for every transit, consequently Ar 
will be the same for each transit, and thus be absorbed into the fitting 
parameter to. These aspects will be discussed in turn. 

As a result of stability constraints, 8 moons complete many orbits of their 
host planet during a single planetary year (Kipping, 2009a). Consequently, 
Ar also goes through many cycles between one transit and the next, in other 
words it is aliased, and thus there are a quantised set of angular frequencies 
which will produce the same series of snapshots and consequently produce 
similar values of Ar. To see this, consider the parameter that defines the 
angle that a moon has progressed around its orbit from one transit to the 
next, the number of months per planetary year, n m /n p , and in particular, 
the fractional part of this quantity. Consequently, if n m were increased by 
n p , such that the fraction n m /n p increased by one, then the snapshot of the 
position of the planet and moon observed at each transit would still be the 
same. Mathematically, this can be seen by replacing n m with n m + kn p , 



where k is an integer, in equation (8.70), the equation for the case where 
v m /vtr is small, and noting that the equation remains unchanged. 9 



8 For the case where a m — O.HRh, an estimate for the orbit with the lowest value of n m , 
it can be shown that n m — y/24n p . Consequently, n m > n p , and T m the orbital period of 
the moon is always smaller than T p , the orbital period of the planet. 

9 For larger values of v m /vt r , where motion of the planet and moon during transit is 



non-negligible, such as the situations described by equation (8.731, Ar will change when 
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As we will investigate how moon detectability changes with semi-major 



axis a m in sections 10.4 and 10.6 it would be useful to recast this result in 
terms of a m as opposed to n m . The angular frequency n m is related to the 
semi-major axis through 

2 GjMrn + M p ) 
n m = 3 • 8-76 

Using implicit differentiation to take the derivative of this equation with 
respect to n m gives 



G(M m + M p ) da m r „ 77 s 
2n m = -3 _ _ . (8.77) 



Now, consider Aa m , the change in moon semi-major axis, which occurs 
when n m is increased by n p . As the relative change in a m and n m moving 
from an orbit with mean motion n m to one with mean motion n m + n p is 
small (i.e n p <C n m and Aa m <C a m ), we can replace the derivatives with As. 
Doing this, noting that An m , the change in mean motion between similar 
orbits is given by n p , and rearranging to give Aa m as a function of n m and 
n v gives 

Aa m = -nmain P3G{M * + Mp) . (8.78) 



From equation (8.76) we have that n m oc a m . Combining this with equa- 
tion (8.78) and noting that n p is constant, we have that Aa m oc am ■ Now 
that the general case has been discussed, we will look at a special set of 
orbits which result in undetectable moons even though At ^ 0. 

For the case where n m = k27r/T p , where k is an integer, that is, the 
moon completes an integer number of orbits each planetary year, the moon 
will not be detectable. This is because the same segment of the moon's orbit 
will be sampled each transit, and consequently no periodic perturbation in 
r will be observed. The resulting structure on a plot of the amplitude of the 
detectable portion of At vs. a m is a non-detection spike, i.e. the function 
will go to zero for these particular orbits. In particular, the relative width 
of these dips can be estimated. 

To do this, consider the case where a moon has initial position angle 
fm(to), and orbital angular frequency n m = k2ir /T p + An m , where An m 
is very much smaller than 2tt. The moon will progresses around its orbit 
by an angle of An m from transit to transit and the corresponding orbit is 



n m is incremented by n p , but only slightly. 
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Figure 8.10: Schematic diagram showing four transits for five planet-moon 
systems with slightly different moon semi-major axes. The different planet- 
moon systems are arranged from left to right. For each planet moon system 
the relative orientation of the planet and moon during transit is shown in 
the vertical column at the top of the diagram with the first transit being at 
the top and the last transit at the bottom, where the position of the star 
and the observer is given by the grey semi-circle and the eye respectively. 
The corresponding phase-wrapped sequence of At values (crosses) is then 
shown in the small plot below. Finally, the amplitude of Ar (once linear 
trends have been removed) is shown in the lower plot by a thick black line, 
while the values for each of the five systems are shown by grey dots. 
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An m away from the center of a spike. If the in-transit position angle of the 
moon only samples a region of Ar which is well approximated by a straight 
line, it doesn't matter how many measurements are taken, or the accuracy 
of these measurements, the moon will not be detectable, and the orbit will 
correspond to a position inside the spike (see figure 



8.10). 



For the inner three systems shown in figure 8.10 the region of Ar sam- 
pled by the four observed transits can either be well described by a straight 
line or a by point, resulting in the perturbation Ar being undetectable for 
this small, but non-zero, range of a m . To begin, let the angle over which 
At is well approximated by a straight line near the angle / m (io) be Af m 
where we note that Af m is a function of / m (io)- If the difference between 
the position angle of the moon during the first transit and the position angle 
of the moon during the last transit is larger than this threshold value, then 
the moon will be detectable, while if it is smaller than the threshold value, 
then the moon will be undetectable and lie in the spike. 

Mathematically, the An n value which corresponds to the edge of the 
spike is given by 

(/m(*o) + (N- l)An m T p ) - f m (t ) = (N- l)An m T p = Af m . (8.79) 



Rearranging equation (8.79), in terms of An m gives 



An m = - -fa . . (8.80) 
T p (N - 1) y ' 

It can be seen that the width of these regions depends on f m (to) through the 
Af m term, and that it is inversely proportional to (N — 1) and independent 
of n m . As discussed in the previous section, small intervals with width 
that are independent of n m have width proportional to a^ 5 when written in 
terms of a m . Consequently, for large N, the core of these dips have width 
proportional to N~ l a^ . 

8.4.2.4 Summary of properties 

Bringing all this work together, we can now summarise the expected proper- 



ties of the detectable portion of Ar as a function of a m . From section 8.4.2.1 



we have that gross behaviour of this function is linear in a m , however from 



section 8.4.2.3 we know that fine level structure is also present in this func- 
tion. Due to aliasing the function is comprised of discrete, repeating blocks, 
with length proportional to a^ 5 . Also, at the start and end of these blocks 
are non-detection spikes with width proportional to A r_1 a^ 5 , where the 
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Figure 8.11: Sketch of the size of At, once linear trends have been removed, 
as a function of moon semi-major axis. For reference, a dashed line propor- 
tional to a m is also shown. 



proportionality constant depends on f m (to). To represent this behaviour, a 
model diagram was constructed showing the "amount" of At that could be 
detected, for the case of a moon with known (and constant) mass and size 
but with varying semi-major axis (see figure |8TT| ) . Now that the case of 
circular coplanar orbits has been fully explored, let us expand our analysis 
to the case of inclined orbits. 



8.5 Circular planet orbit inclined to the line-of- 
sight 

For the case where the orbit of the planet is still circular, but slightly inclined 
with respect to the line-of-sight, the planet and moon no longer transit the 



central chord of their host star (see figure 8.12). As for the case of circular 
and coplanar orbits, we begin the process of deriving At by considering and 
simplifying the equations of motion of the planet and moon. For the case 



174 




Figure 8.12: Schematic diagram of the same form as figure 8.4 of the coor- 
dinate system for the case of a circular but slightly inclined planet orbit. In 
particular, it is assumed that J m = tt/2 and m = U T 



up. 



and r p = a p . 



of a circular, slightly inclined planet orbit we have that e p 
Similarly, for a circular moon orbit aligned to the line-of-sight 10 we have 
that I m = tt/2, Q, p = Q m , e m = 0, r m = a m and f m = n m t + f m (0). Using 
the expressions given above, equations (8.15) to (8.18) simplify to 

M m 



a p cos(/ p + uj pJ - — 
a p cos(/ p + Up) + 



prn 

M p 



M, 



a m cos(n m t + f m {0) + w„ 
a m cos(n m t + f m (0) + oj n 



prn 



a p cos Qp cos I p sin(/ p + u p ), 
a p cos fi p cos I p sin(/ p + uj p ) , 



(8.81) 

(8.82) 
(8.83) 



where we note that the y-components are constant to order R s /a p and can 
be safely subsequently neglected. 11 



10 Recall from section [8.2. l| that the case where the orbit of the moon is aligned with 
the line-of-sight is indi sting uishable from the case of circular coplanar orbits. 



Recall from section 



7.2 



that the value of f p corresponding to the planetary transit mid- 
time is approximately given by tt/2— uj p . Performing a Taylor expansion of equations ( 8.83 1 
and (8.841 about f p = -n/2 — uj p , gives y p — y m = a v cosf2 p cos/ p (l — l/2(/ p — (tt/2 — lu p )) ). 



Again recalling from section (7.2 l that the change in f p during transit is of order R 3 /a p , 
we have that y p = y m = a P cos f2 p cos I p to first order in R s /a p . 
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Again performing a Taylor expansion of the first terms in equations (8.81 ) 
and (8.82) about the time that the j th transit would have occurred if there 



had been no moon, gives 

M m 

Xp = v tr (t - (jT p + t )) - — — a m cos(n m t + f m (0) + u m ), (8.85) 

x m = v tr (t - (jT p + t )) + -^-a m cos(n m t + f m (0) + u m ). (8.86) 

Mpm 

These expressions for the position of the planet and moon on the face of 
the star can again be used to determine the time of ingress and egress by 
equating the left hand side of the equations to the x-coordinate of the limb 
of the star. 

For the case of an inclined planetary orbit, the x-position of the limb of 
the star is given by 



x p = ±jR 2 -6 2 min , (8.87) 



±\ m - si 



mm ' 



for the case of the planet and moon respectively, where 5 m i n is the mini- 
mum projected distance between the center of the planet and the star and 



is equal to (R 2 — a 2 cos 2 Ip) 1 ^ 2 for this type of orbit (see section 7.2). Conse- 
quently, the equations describing the ingress and egress times of t 
and moon's transit can be defined implicitly using 



le planet 



R 2 s-Sl in = Vtr(t in , p -(jT p + t )) 

M, 



Mp m 



+ f m {0)+uj m ), (8.89) 



R l ~ 6 min = Vtr(Un,m ~ U T P + *o)) 



+ f m (0) + u m ), (8.90) 



/ R 2 s-S 2 mln = Vtr(te g ,p- (jTp + to)) 

M, 



Mp m 



"ra COs(?T. m ieg,p + /m(0) + Wm), (8.91) 
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'm - si. 



Vtr(t 



eg,m 



UT P + to)) 

+ J^~ a ™ cos(n m t eg ^ m + / m (0) + w m ). (8.92) 



pin 



Again denning 9 in;P = n m t in , p + f m (0) + uj m + n/2, 9 eg , p = n m t e9iP + 

eg,m — ^mteg,m~^" 



"v. 



n m ti ni m+fm(Q)+u m +ir/2 and 6, 



/m(0) + w m + tt/2, and substituting these expressions into equations (8.89) 
to ( |8.92| ) we obtain 



+ g + + / m (0) + n m (jT p + t ) 



sin(0 inp ), (8.93) 

Vtr 



n m jRl ^min IT 



Vtr 



+ - + w m + / m (0) + n m (jT p + t ) 



#m,m + — sin(9 inm ), (8.94) 



+ - + W m + /m(0) + n m (jT p + t ) 



%n,m sin(0j nm ), 



.95) 



+ - + w m + / m (0) + n m (jT p + t ) 



9e 9 ,m + — sin(6> e9>m ). (8.96) 

V tr 



Comparing these equations to equations (8.47) to (8.50), the equivalent set 



for the case of circular coplanar orbits, it can be seen that they are of the 



same form. In particular, they can also be represented by equation (8.53), 

$ = 9 + Bsm(9). 



However, in this case, the expressions for $ are no longer given by those in 



table 8.1, but instead by those in table 8.3 
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_x 

in,p 
in, m 
eg,p 
eg, m 



/ m (0) + u m + f + n m (jT p + t ) 
/ m (0) + u; m + § + n m (jT p + t ) 
/ m (0) + w m + f + n m (jT p + t ) + 
/m(0) + u; m + § + n m (jT p + t ) + 



2 

??i in. 







■'' 777. 

Vt r 
Vp 
Vtr 

I'm 
Vt r 



Table 8.3: The values of <& and B corresponding to equations (8.93) to 
([87961). 



To solve this equation, we note that in section 8.4 an expression for 
9 was derived for the case where B < 0.6627, which was accurate for all 
values of As the values of B for this case are the same as those for the 
case of circular coplanar orbits, and only the values of $ are altered, the 
solution derived for the case of circular coplanar orbits can be directly used 
to determine 9 and thus Ar for the case of inclined orbits. 



8.5.1 Form of Ar 



To begin the process of deriving expressions for Ar, we note that equa- 
tions ( |8.47D to ( |8.50| ) and ( |8.93| ) to ( |8.96| ) only differ in their $ terms. In 
addition, as the expressions for $ for the case of circular and coplanar or- 
bits can be transformed to those for inclined orbits by replacing R s with 
— 5^ nin ) 1 ^ 2 , and R s and {R 2 S — 5 2 nin ) 1 ^ 2 do not occur anywhere else 
in the equations, the expressions for Ar can be modified to give equiva- 
lent expressions for the case of inclined orbits by simply replacing R s with 
I 1 / 2 . This will be done for the two cases considered in section 8.4.1 



(R 2 s -s 2 n 

the case where v m /vt r <C 1, and the case where v m /vt r < 0.66. 



8.5.1.1 Case where v m /vt r *C 1 



In section 8.4.1.1 two expressions were derived for the case where v m /vt r 
1, equation (8.70), a general equation, and equation (8.72), a simplified 
version of this equation. The equivalent expressions for the case of inclined 



planetary orbits are given by replacing all instances of R s with (Rj—8 2 



a/2 
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in these two equations. Doing this, the following is obtained 



At 



A m .M„ 



ApM m a Tl 



Ap m Mp m 



cos 



Vtr 



n m \/R 2 s - (a p cosJp)M 
) 

x cos (f m (t ) + uj m + n m jT p ) , (8.97) 



and 



At 



A, 



Apm M pm 



M p a m f n m jR 2 s - (a p cosI p ) 2 
— cos 



Vtr 



Vtr 



where the expression 5 r , 



x cos(/ m (i ) + uj m + jn m T p )), (8.98) 
a p cos I p has been used, and where equa- 



tion (8.97) represents the general solution and equation (8.98) represents 



the simplified solution. In addition, the higher order approximation to At 
for the case of circular coplanar orbits can also be modified to give the 
associated expression for inclined orbits. 



8.5.1.2 Case where v m /vt r < 0.66 

Again, expressions for At can be investigated for larger values of B by 
looking at the expression correct to second order in B. Replacing R s with 
(R 2 — ^min) 1 / 2 in equation (8.73), the equivalent equation for the case of 
circular coplanar orbits, gives 



At 



A m Mp 



ApM m a Tl 



ApmMpm 



cos 



n m \jR 2 s - (a p cosIp) 2 



Vtr \ V tr 

X COS (f m (t ) + Urn + jn m T p ) 



ApMl + A m M 2 



— cos 2 



n r . 



A M 2 



\J R 2 S — {dp COS I p ) 2 



Vtr V tr \ V tr 

x sin (2 (fm(to) + w m + jn m T p )) 

\/R 2 — (a P cos I p ) 2 



1 A m A„ a m a m . i 
+ - — sin I 2 



4 A 2 m V tr Rs 



Vtr 



x sin (2(/ m (to) + 0J m + jn m T p )) . (8.99) 



Again, to determine the effect of truncating the expansion at first or- 



der in B, equations (8.97) and (8.99) were compared with the value of At 
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(a) a m =2R a , v m /v tr =0.66. (b) a m =R s , v m /v tr =0.G6. (c) a m =0.5R s , « m /w tr =0.66 



-0.0(15 



(d) a m =27? s , v m /v t r=0A9. (e) a m =R s , v m /v tr =QAQ. (f) a m =Q.5R s , v m /v tr =0A9. 



(g) a m =2R s , Vra/vtr—0.33. (h) a m —R a , v m /v tr —0.3S. (i) a m =0.5i? s , w m /ut r =0.33. 



(j) a m =2R s , v m /v tr =0.16. 
Figure 8.13: Figure of the 

$min — 0.5-/? s . 



(k) a m =R 3 , v m /v t r=0-16. 

same form as figure 8 J 



(1) a m =0.5i? s , v m /v tr =0-l6. 
but calculated assuming 
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calculated from a simulated light curve. These results are presented in fig- 
13) As was the case for circular and coplanar orbits, the values of At 



ure 



given by equations (8.97) and (8.99) qualitatively agree with the numerically 
calculated values, even for extreme values of v m /vt r - Now that the form of 
At has been derived for the case of slightly inclined planetary orbits we can 
explore the effect of inclination on the properties of the signal. 



8.5.2 Effect of inclination on At 



Inclination of the planetary orbit has two main effects on At. First, the 
amplitude of the signal is slightly increased. Second, it results in an increase 
in the accuracy of At through an increase in the accuracy of approximation 
of uniform velocities during transit. These two effects will be discussed in 
turn. 



As can be seen by comparing equation (8.70), the expression for At for 



the case of circular coplanar orbits with equation (8.97), the expression for 



At for the case of slightly inclined orbits, the only term which is modified is 



the cosine term in the amplitude. As discussed in section 8.4.2.1 this term is 
likely to be approximately unity as the the argument of the cosine function 
is likely to be small. In addition, as (R 2 S — 5^ nin ) 1 ^ 2 < R s by definition, 
any inclination of the planetary orbit will act to reduce the argument of the 
cosine term further and consequently it the cosine term will tend to increase 
toward one. As a result, inclination of the planetary orbit causes a slight 
increase in the amplitude of At. 

In addition to modifying the amplitude of At, changing the inclination 
of the planetary orbit also increases the accuracy of our approximation to 
At by increasing the accuracy of the assumption that the planet and moon 
travel with uniform velocities during transit. Physically this is a result of 
the reduced transit duration (and thus the shorter amount of time in which 
the planet or moon has to accelerate). Consequently for the case of inclined 
orbits, the expression derived for At is likely to be more accurate than 
the equivalent expression for circular coplanar orbits (in particular compare 
figure 8.8(c)| and % ure 8.13(c) ). 



Now that the form and properties of At for the case of inclined orbits 
have been investigated, we will look at our third and final special case, the 
case of eccentric planetary orbits. 
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Moon Orbit 




Figure 8.14: Schematic diagram of the same form as figure 8.4 of the coor- 
dinate system for the case of an eccentric planet orbit aligned to the line-of- 
sight. In particular, it is assumed that I p = vr/2, I m = tt/2 and Vt m = n, r 



8.6 Eccentric planet orbit aligned to the line-of- 
sight 

Eccentricity in the planet's orbit can also alter the behaviour of At. To 
isolate the effect of non-zero eccentricity in the planet's orbit, it was assumed 
that the orbits of the planet and the moon were coplanar, and aligned to 
the line-of-sight, that is, I p = tt/2, I m = tt/2 and ft p = Cl m , and that the 
moon's orbit was circular, that is, r m = a m and f m = n m t + / m (0). 

Again, the dependence of At on the orbital parameters of the planet 
and moon can be determined from an examination of the time of ingress 
and egress of the planet and moon. Rewriting equations 
under these assumptions gives 



.15) and (8.16), 



x p = r p cos(f p + uj p ) - 
x m = r p cos(f p + Ld p ) + 



M p + M m 

M m 
M p + M m 



a p cos(n m t + / m (0) +w m ), (8.100) 
a p cos(n m t + / m (0) +u) m ), (8.101) 



where r p is defined by equation (8.9) and f p is defined by equations (8.11 )and 



.13). We can again Taylor expand the first terms in equations (8.100) and 
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(8.101) about the the time that the mid-time of the transit would have 
occurred had there been no moon, giving 



F(e p ,uj p 



(t-(jT p + to)) 



M„ + M r 



-a p cos(n m t + f m (0) + u> r , 



1.102) 



x m= F( ; *d t-(jT p + to)) 
t (ep,LOp) 



+ 



M v + M r 



-a p cos(n m t + / m (0) + u) r , 



1.103) 



where we recall that the transit velocity for a circular orbit is given by n p a p , 
and that for the case of eccentric orbits, the transit velocity is modified by 



a factor F , given by equation (7.40), and restated below for convenience 



F(e p ,u) p 



(1 + e p sinw) 



cos 2 (tan 1 



i+ep tan U ~ 2 j 



cos- 



U 2 ) 



e p (smojp + e p ) 
1 - e 2 (l + e p smu!p) 



Now, equations (8.102) and (8.103) are mathematically equivalent to 



equations (8.39) and (8.40), the equations describing circular coplanar or- 



bits. In addition, as the x-coordinates describing the start and end of the 
transit (x = ±i? s ) are also the same, the expressions derived for At for the 
case of circular coplanar orbits can be directly modified to give At for the 
case of eccentric outer orbits, by replacing i> ir with n p a p /F(e p ,ujp). 



8.6.1 Form of At 



Using this method of substituting n p a p / 'F(e p , oj p ) for vt r into equations ( 8.70 ) , 
(8.72) and (8.73), expressions for At for the case of eccentric planet orbits 
can be derived. Again, these expressions will be investigated for the case 
where v m /vt r <C 1 and the case where v m /v tr < 0.66. 
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8.6.1.1 Case where v m /vt r "C 1 

For the case where v m /vt r <C 1, this substitution results in 

A m M p -A p M m a m ( n m R s F(e p ,ujp) 

At = t 2 - F{e p ,ujp) cos 1 

ApmMpm o, p n p 



X COS (fm,(t ) + L0 m + n m jT p ) , (8.104) 



and 



A m M p a m ( n m R s F(e p ,u p )\ 
At w -j — —^F{ep,ujp) cos 



x cos(/ m (i ) + u m + jn m Tp)), (8.105) 
where the first term in the quotient has again been neglected. 

8.6.1.2 Case where v m /vt r < 0.66 

Similarly, for the case where v m /vt r < 0.66, this substitution results in 
_ A m M p - A p M m a m jp/ f n m R s F(e p ,cv p ) 

iA7~ — ^ y&p^ (~dp j COS I 

ApmMpm a, p n p \ a p n p 

/ p / . \ . . „ n A p M m + A m Mp a m n m a m 

x cos (/ m (t ) + w m + jn m T p ) 



x F(e p , Up) 2 cos / 2 n ^ F ( e P'^) j sin (2 (/m ^ o) + Wm + jn m T p )) 

4 A* m a p n p K s \ a p n p 

x sin (2(/ m (i ) + w m + jn m T p )) . (8.106) 

8.6.2 Effect of eccentricity on Ar 

As the signal in the case where the planet's orbit is eccentric is mathemat- 
ically equivalent to the case for circular coplanar orbits, all the properties 



discussed in section 8.4.2 still hold. However, eccentricity in the planet's 
orbit alters the transit velocity, which means that it alters the amplitude 
of Ar and also alters the point at which the expansion used to derive the 
expressions for Ar breaks down (v m /vt r < 0.66). These two issues will be 
discussed in turn. 
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For the case where the planet's orbit is eccentric, its transit velocity may 
no longer be equal to a p n p . In particular, the value of vt r depends on both 
e p and uj p and can include values from n p a P y/l + e v / A /l — e p at per iastron 



to n p a p ^J\ — e p j yjl + e p at apastron (e.g. Murray & Dermott, 1999 p. 31). 
As the amplitude of At is proportional to ljvtr, eccentricity in the planet's 
orbit can affect At, and does so in three different regimes: 

1. For the case where the planet's orbit is highly eccentric, and the tran- 
sit occurs near pericenter, the transit velocity is increased and the 
amplitude of At is consequently reduced from the equivalent value for 
circular orbits. 

2. For the case where the planet's orbit is not very eccentric, or has 
just the correct orientation, the transit velocity of the planet can be 
approximated to be equal to that for the equivalent planet on a circular 
orbit. In this case, unsurprisingly, At is unchanged. 

3. Finally, for the case where the planet's orbit is highly eccentric, but the 
transit occurs near apocenter, the transit velocity is decreased and the 
amplitude of At is consequently increased from the equivalent value 
for circular orbits. 

While it is possible that a planet on an eccentric orbit could transit near 
apocenter, it is not likely (see appendix [H]) . Consequently, for a majority of 
transiting planets, eccentricity in the planet's orbit will either decrease, or 
leave unchanged the amplitude of At. 

In addition to affecting its amplitude, the value of vt r also determines 
whether or not the expansions used to derive the expressions for At are 
accurate. In particular, for the case where vt r is high (case 1 above) the 
expansions is more accurate, for the case where vt r is approximately equal 
to that for the equivalent circular orbit (case 2 above), the expressions are 
as good as for the circular case and for the case where vt r is low (case 3 
above) the expansions is less accurate. Fortunately, it is more probable that 
a given planet will transit while its velocity is greater than a p n p than for it 
to transit while less than than a p n p (see appendix [H]) , especially for highly 
eccentric orbits. Consequently, for the case of eccentric planetary orbits, the 
expansion used to derive At is likely to be more accurate than for the case 
of a circular orbit. 

For the case of planets with eccentric orbits, eccentricity has the following 
two effects on At. As planets are more likely to transit when their velocities 
are higher, the amplitude of At is likely to be reduced, but the accuracy of 
the analytic expression for At will increase. 
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8.7 Conclusion 



The form and properties of At have been investigated for three cases for 
which the moon's orbit was circular and coplanar with the planet's orbit 
(sections 8.4 8.5 and 8.6), and one case for which it was not (appendix [I]). 
Analytic expressions for At were derived assuming that the velocity at which 
the planet-moon pair orbited each other was much smaller than the velocity 
at which they transited their host star. In addition, a number of properties 
of the signal arising from its form were derived for each of these cases. The 
form of these expressions, and their properties will be discussed in turn. 



8.7.1 The form of At 

For the case where the moon's orbit was circular and coplanar with the 
planet's orbit, the form of At was investigated for planet orbits which were 



circular and aligned to the line-of-sight (section 8.4), circular and slightly 



inclined to the line-of-sight (section 8.5) and eccentric and aligned to the 



line-of-sight (section 8.6). In addition, the case where the moon's orbit 
was slightly eccentric and coplanar with the planet's orbit, for the case 
where the planet's orbit was circular and aligned to the line-of-sight was 
investigated in appendix [TJ From the work in sections 8.4, 8.5 and 8.6, the 
cases where the moon's orbit was circular and coplanar, it can be seen 
(compare equations ( 8.70) ), ( |8.97 ) and (8.104)) that At is given by a function 
of the form 

At = Acos(uj + <p). (8.107) 
For reference the values of the coefficients A, to and <b for the three cases are 



given in table 8.4. To provide physical intuition the amplitudes that would 
have been produced by a Jupiter- Ganymede and Earth-Moon planet-moon 



pair for a range of different orbital configurations are given in table 8.5. As a 



result of its general applicability, equation (8.107) is the equation which will 



be used in the analysis in chapter 10 However, while this is the equation 



that will be used, it should be noted that it will not hold for all cases. 
In particular, as shown in appendix [I] and section 8.4.1.2 non-negligible 
values of e m and v m /vt r can break the symmetry and introduce higher order 
sinusoid components. 



8.7.2 The properties of At 

In addition to its form, the At signal has two main types of properties. First 
it has a characteristic size, parameterised by the amplitude. Second, owing 
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Table 8.5: At amplitudes calculated for a range of values of planetary semi- 
major axis, eccentricity and orbital orientation for the case of an Earth-Moon 
and Jupiter-Ganymede planet-moon pair assuming they transit the central 
chord of a Sun-like star. For reference uj p = ir/2 and ui p = —ir/2 correspond 
to a transit which occurs during periastron and apastron respectively. The 
values for planet-moon pairs which are three-body unstable are crossed out. 



to the way in which r and thus At are measured, only part of this amplitude 
is detectable. This detectable component shows a range of interesting and 
relevant behaviour as a function of moon semi-major axis. These aspects 
will be summarised in turn. 



8.7.2.1 The properties of the amplitude of At 

One way to consider At is to consider the average size of the perturbation 
caused by the moon. As can be seen from equation (8.107), At can be 
approximated by a sinusoid and thus the degree to which a given moon per- 
turbs transit timings from strict periodicity can be estimated by inspecting 
the amplitude. Mathematically, as can be seen from table [8^4} the At am- 
plitude mainly depends on three factors A m /A p , a m and vt r - These variables 
will be discussed in turn. 

First, the amplitude of At is proportional to A m /A p to first order in 
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Vm/vtr- This implies that it is the physical size of the moon and planet 12 
which affects the amplitude of At, not their masses. This is in striking 
comparison with the two other timing methods presented in the literature, 
barycentric transit timing and transit duration variation, which have timing 

1/2 

amplitudes proportional to M m /M p and M m /M p respectively. 

Second, the moon's semi-major axis also affects the amplitude of At. 
In particular, At oc a m , such that moons which are distant from their host 
planet will have larger At amplitudes than moons which are close to their 



host planet. Recall from section 4.3.2 that the size of the signal for direct 



detection does not depend on a m , while the timing amplitude for the case of 
barycentric transit timing and transit duration variation is proportional to 

— 1/2 

a m and a m respectively. These differences again indicate that there may 
be regions of parameter space which the TTV P method could be optimised 
to probe. 

Third and finally, the amplitude of the At signal depends on the transit 
velocity of the planet. As discussed during this chapter, the transit velocity 
of the planet depends on the semi-major axis, eccentricity and orientation of 
the planet's orbit, aspects which will be discussed in turn. The semi-major 
axis determines the size of the orbit as well as the velocity of the planet along 
that orbit. In particular, distant planets will have low transit velocities, 
long transit durations and large At amplitudes, while closer planets will 
have high transit velocities, short transit durations and low At amplitudes. 
In addition to depending on the size of the orbit, the transit velocity also 
depends on the eccentricity and orientation of the orbit. While it is possible 
to have transit velocities ranging from n p a p ^l + e p / ^/l — e p at perihelion 
to ripdpy/l — e p j yj\ + e p , (recall that for a circular orbit vt r = a p n p ), it is 
more likely (see appendix [H]) that a planet on an eccentric orbit will transit 
while its velocity is higher. Consequently planets on eccentric orbits are 
likely to have smaller At values than those for planets on a circular orbit 
with the same semi-major axis. 

To summarise, the amplitude of At depends on the physical size of the 
planet and moon, the semi-major axis of the moon and on the semi-major 
axis, eccentricity and orientation of the planet's orbit, through the transit 
velocity. However, the detectability of a given moon is not just determined 
by the amplitude of At, it also depends on the amount of this amplitude 
which can practically be measured, which, in turn depends on the properties 



12 As will be shown in the next chapter, the amplitude of tj, the timing noise, is pro- 
portional to (As/ A p ) x ((R 3 + am)/ Rs)- Consequently, the detectability of a given moon, 
determined by comparing the sizes of At and e-j, will not depend on A p , only A m . 
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of At. 



8.7.3 The properties of the signal At 

In particular to gain a full understanding of At we not only need an under- 
standing of its amplitude, but also on the proportion of its amplitude which 
can be detected, which, is defined by how At is measured. As discussed in 



section 8.4.2, At is measured as a perturbation to a linear signal and can 
only be measured once per orbit. These issues will be discussed in turn. 
First, consider the time that is measured, tj. From the definition of T/, 

Tj = t + jT p + At + ej 

it can be seen that it is comprised of three terms, a linear term (to + jT p ), 
At, a sinusoidal term and a noise term, ej. However, for the case where 
the moon orbits an integer number of times around its planet per planetary 
orbit or a near integer number of times, At can be constant or approximately 
linear. As a result, for these cases, the perturbation due to the moon will be 
included in the linear fit, and the moon will be effectively undetectable. As 
a result, when the detectable portion of At is plotted against moon semi- 
major axis, a moon which fulfills the above criterion will reside in a non- 
detection spike with width proportional to N~ 1 a^, where N is the number 
of transits recorded, and where the proportionality constant depends on (j>. 

Second, At can only be measured once per orbit, in particular only 
during transit. As a result, it is the configuration of the planet and moon 
during transit which determines At. Consequently the form of the TTV P 
signal of a moon which orbits its planet, for example, n + 1/4 times per 
planetary period (i.e the moon advances around its orbit by 1/4 x 2tt radians 
from one transit to the next) will be the same as a moon which orbits 
its planet (n + 1) + 1/4 times per planetary period (i.e. the moon also 
advances around its orbit by 1/4 x 2ir radians from one transit to the next). 
Consequently, when plotted against moon semi-major axis, moon detection 
thresholds should contain repeating blocks which look the same. 

8.7.4 Conclusion 

The TTVp signal was investigated by deriving expressions and and exploring 
properties of At for a range of realistic planet-moon configurations. In 
particular, At was investigated for three cases where the moon orbit was 
circular and coplanar with the planet orbit, specifically, where the planet's 



orbit was circular and aligned to the line-of-sight (section 8.4), circular and 
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slightly inclined to the line-of-sight (section 8.5) and eccentric and aligned to 
the line-of-sight (section |8.6[ ). In addition, to determine the effect of altering 
properties of the moon's orbit, the case where the planet's orbit was circular 
and aligned to the line-of-sight, but the moon's orbit was eccentric was also 
investigated (appendix [I]). It was discovered for the case where the moon's 
orbit was circular and coplanar with the planet's orbit that At could be 
approximated by a sinusoid, and that high values of v m /vt r or eccentricity 
in the moon's orbit introduce higher order terms (see section 8.4.1.2 and 
appendix [I]) . In addition, At has a number of properties as a result of the 
fact that it can only be measured once per transit, and is measured in the 
context of a linear trend (to+jT p ). In particular it was found that for moons 
which orbit their host planet an integer number of times per planetary orbit 
are not detectable, as for this case At becomes a linear function of transit 
number and cannot be independently determined using the fit. Armed with 
these expressions and properties, only information about e,-, the timing noise, 
is needed to derive and interpret detection thresholds. With this in mind, 
the timing noise €j will be investigated in the following chapter. 
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Chapter 9 



Measurement error in TTV p 



signal 



9.1 Introduction 

Whether the TTV P signal of a given moon can be detected depends not only 
on At, the timing signal produced by the moon, but also on the character- 
istic size and behaviour of ej, the timing noise masking this signal. This 
timing noise is a result of the perturbations on r caused by the sequence of 
a n values for a given transit, where a n is the sum of all sources of photo- 



metric error for a particular exposure. As shown in section 7.4 this error, 



to first order in £\ a n / (A p + A m ), is given by equation (7.60) 

*3 = A T A & " U T P + t ° + Ar )l a ^i)- 

The photometric error in a light curve, a n , can come from many sources, 
including detection error, propagation effects, shot noise (error due to small 
number statistics) and variability in the signal source, in this case the host 
star. As the behaivour of ej is dictated by the behavour of a n through 



equation (7.60), ej will be investigated within the context of three realistic 
noise sources. First, the distribution of ej is determined analytically assum- 
ing all noise sources produced white noise. This case is selected as white 
noise is easy to theoretically manipulate, as shot noise and common types 
of instrumental noise such as dark noise and read noise are well described 
by white noise, and as initial investigations into TTV P detection, assuming 



white noise, are present in the literature (e.g. |Szab6 et al. 2006). Second, 



the distribution of ej is numerically investigated for the case of realistic 



193 




time 



Figure 9.1: An example transit showing measured intensities (dots) and the- 
oretical intensity (line). The quantity ct(U) is shown for one of the exposures 
and the region which is not counted within the sum is shaded grey. 



stellar photometric noise. This case is selected as intrinsic photometric vari- 
ability of stars is not a white noise process, and it has long been known 
that red photometric noise due to stellar variability, is a limiting factor in 
transit surveys (e.g. Borucki et al. 1985). While this type of noise cannot 



be completely avoided, methods have been proposed in the transit detection 
literature to reduce its effects on planet detection. Most of these involve 
some form of filtering. Consequently, the distribution of ej is also deter- 
mined using realistic stellar noise which had been "filtered" for trends due 



to the rotational modulation of starspots using the method of Lanza et al. 
( |2003l ). 

Before the distribution and behaviour of €j can be investigated for these 
three cases, appropriate methods for determining €j using equation (7.60) 
need to be selected. Consequently, the properties of equation (7.60) will 
be explored with the aim of selecting the optimal method for determining 
€j analytically, for the case of white noise, and numerically, for the case of 
realistic and filtered realistic stellar photometric noise. 



9.2 Method 



Before investigating the effect of white, realistic and filtered stellar pho- 
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tometric noise on ej, two issues must be addressed. First, the method for 
determining the properties of €j for a given sequence of a n needs to be 
discussed. Second, for the cases of realistic and filtered photometric noise, 
photometric noise from real stars is required. Consequently, the issue of 
whether solar photometric data is representative of the photometric noise 
from a typical star will be discussed. 



9.2.1 Method for calculating ej 



We begin our investigation with equation (7.60), the equation which links 



€j and a n , restated below for convenience 



Ap + A n 



^ [U -t - jT p - At] a n (ti) 



As described in section 7.4.4, the sum is evaluated over a time interval 
of length T b s centered on the planetary transit (see figure 9.1). While 



equation (7.60) is exact, it is not in the most useful form. In this chapter, 



an approximation to equation (7.60) will be used, mainly as it is physically 



realistic, and the simplified version has a suite of properties which can be 
exploited. 

To begin, we consider the relative sizes of the terms in square brackets. 



First, ti is the time at which the i exposure is taken in a given transit, 
while to + jTp is approximately the mid-transit time for that transit. Con- 
sequently the quantity ti — to — jT p ranges from — 1/2T (J \ )S to 1/2T & S . As 
the length of the observing window must be at least as long as the transit 



duration (see figure 9.1 ) and transit durations are typically many hours long 
(see table 7.1), ti — to — jT p is of the order of many hours. Conversely, as 



determined in chapter ^1 At is of the order of minutes for physically realistic 



moons (see table 8.5). As equation (7.60) is dominated by large values of 
ti — to — jTp, the At term can be neglected, giving 



1 

Ap + A Tl 



y~] [U - t - jTp] a n (ti 



(9.1) 



This is the equation that will be used to analytically investigate the case of 
white noise. 

For the case of realistic and filtered noise, real photometric light curves 
will be manipulated to give a sample of ej values, which will consequently be 
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used to estimate the distribution and behaviour of ej. 1 To begin, consider 
an explicitly measured light curve L(tj), given by 



L(U) — Lq + L n {ti 



(9.2) 



where Lq is the average value of L(ti), and L n (ti) is the zero- mean photo- 
metric noise. Now, some stars may exhibit more photometric noise than the 
star from which our real light curves were taken, while others may exhibit 
less. To explore the effect that the amplitude of the photometric noise has 
on €j, we will focus on the case where Lq is the same, but the amplitude of 
the photometric noise has been scaled by a factor (3. 2 Let us define L*, the 
luminosity associated with this general light curve, as 



L*(U 

where f3 is a known constant. 



L + pL n (ti), 



(9.3) 



Currently equation (9.1) is formulated in terms of a n (ti). As we will be 
using real light curves to estimate ej, we would like to write equation (9.1) 
in terms of L(ti) as opposed to L*(ti) or a n (t 



To do this, we define 
J2itiP{L + L n {ti)) 



(9.4) 



where we have multiplied the numerator and denominator by (3 to simplify 
the algebra later on. 

As Si PLo S> ^2if3L n , we can expand t* using the same method used 
in section 7.4.51 to expand r. Noting that ^ i f3L n / PLq is our small 



parameter, and performing the binomial expansion of the denominator gives 



1 



UP{Lq + L 




(9.5) 



As an aside, the assumption that out-of-transit light curves can be used to estimate 
in-transit photometric variability is not necessarily valid. For example, models (Silva 
|2003[ ) and observations (e.g. Pont et al. 20071 show that if a planet passes in front of a 
finite structure on the surface of its host star, e.g. a starspot, during transit, this will 
result in additional perturbations to the transit light curve. The effect of this additional 
noise source on At will not be investigated in this thesis. 

2 Simple noise amplitude scaling will not fully describe the behaviour of ej for all stars. 
The particular shape of the noise power spectrum depends on the processes that produced 



the red noise, for example, granulation (Rabello Soares et al. 19971 and rotational mod 



ulation of active regions (Lean et al. 1998) and their associated characteristic timescales. 



While methods do exist (e.g. Aigrain et al. 20041 for predicting red noise characteris 



tics of stars as a function of physical and spectral properties, this extension will not be 
investigated in this thesis. 
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Now, as Lq is constant, ]T\ Lq = N obs Lo, where N obs is the total number 
of exposures in the observing window. Using this expression to simplify 
equation (|9.5l) and only retaining terms up to first order in L n , gives, 



(9.6) 
(9.7) 



T = ~~ tttt ; r 



PN obs L PN obs L 



PNobsLo PN obs L 



N obs 



+ 



PNobsLo 



f3Nob s L N obs ' 



As the exposures are evenly spaced, ^iU/Nohs, the average of the times at 
which the exposures are taken, is equal to t m ia, the time corresponding to 
the center of the window. Consequently, 



T — tmid ~l~ 



1 



(3N obs L 



(9.8) 



We can now compare this equation with equation (9.1), the equation 



describing ej. First, we note, that for the case described by equation (9.1), 
tmid,p is given by to + jT p . In addition, for the case where the luminosity 
of the star is given by equation (9.3), a n (tj) is equal to — /3L n (tj). 3 Using 



these two expressions, we can rewrite equation (9.1) as 



1 



A v + A Ti 



(9.9) 



Substituting equation (9.8) into equation (9.1) gives 



PN obs L 
-\- A m 
PN obs L 
Ap -\- A m 



\T tmid,p) j 



where 



(9.10) 
(9.11) 

(9.12) 



Consequently, the error e,-, recorded for the case where the host star's 



luminosity is given by equation (9.3), is given by equation (9.10), where r 



is defined by equation (9.4) and where t m id :P is the middle of the piece of 



light curve used to determine the effective value of ej. Equations (9.10) and 



(9.4) provide a simple method for determining a sequence of representative 



e,- values for a transit with a given A p + A m , using out of transit light curves. 



3 Recall that a n is defined as a photon deficit, while L n is defined as a photon surplus. 
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9.2.2 Suitability of solar photometric data for calculating tj 



In order to determine the effect of intrinsic stellar photometric variability on 

a suitable sample light curve which is dominated 



ej using equation (9.10) 



by realistic photometric stellar variability corresponding to a typical star is 
required. This data series should optimally have a number of properties. 
First, it needs to be long enough such that a statistically valid estimate of 
€j can be constructed. Second, the data must have high signal to noise, 
to ensure that the majority of recorded photometric noise is inherent to 
the star, and not resulting from instrumental or statistical effects. Third, 
the data needs to be high cadence, such that the effect of long and short 
exposure times on ej can be investigated. Finally, this data must be easily 
available and easy to manipulate. Fortunately, photometric measurements 
of the Sun meet all these requirements. 

Over twelve years of high quality solar data are available as a result of 
the Solar and Heliospheric Observatory (SOHO). This satellite is positioned 
at the LI point between Earth and the Sun, which allows it uninterrupted 
access to the Sun's behaviour. For this work it is the measurements of 
Total Solar Irradiance (TSI), the total intensity of the solar face, that are 



important. This measurement is made within the Virgo module (Frohlich 



et al. 1995), by comparing the output of the photometers PM06V and 
DIARAD (Frohlich et al.[ |1997[). This data was kindly made available by 



SOHO team and can be freely downloaded from the SOHO archive. 4 

While high quality solar data may be freely available, whether it should 
be used depends on whether the photometric behaviour of the Sun is repre- 
sentative of the photometric behaviour of Sun-like stars, in particular, the 
stars to be targeted by COROT and Kepler. Indeed, it has been found that 
the Sun shows two to three times less variation on the decadal time scale 



compared to similar stars (Lockwood et al. 1997; Radick et al. 1998 Lock 



wood et al. 2007). While this discrepancy may seem discouraging, it needs 



to be viewed in the context of two other findings, the effect of star orienta- 
tion with respect to the observer, and the results of studies of extra solar 
planet host stars. First, it has been suggested that the low observed value 
of solar photometric noise is due to our privileged observing position, that 
is, in the plane of the Sun's equator. The maximum predicted magnitude of 



this effect ranges from an increase in photometric variability of 6 (Schatten 



1993) to 1.3 (Knaack et al. , 2001 ) times that observed in the equatorial plane 



as the viewing angle is altered. Fortunately, it has been observed for both 



transiting (Fabrycky <__ Winn, 2009) and non transiting (Le Bouquin et al. 



http://seal.nascom.nasa.gov/cgi-bin/gui_plop. 
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2009) planets that there is a preference for the orbital angular momentum 
vector of the planet and the spin axis of the host star to be aligned. Con- 
sequently, as the planets of interest transit, the equator of the target star 
should also be preferentially aligned with the line-of-sight. Second, while 



the survey conducted by (Lockwood et al. , 2007) into the photometric vari 



ability of Sun-like stars is the most temporally complete, it is not the only 
survey. In particular, the survey of Henry et al. (2000) has looked at the 



photometric variability of extra-solar planet host stars detected by the radial 
velocity technique. Within this subset, the Sun appears typical. Unfortu- 
nately, as only relatively magnetically inactive stars are chosen as targets 
for radial velocity searches, and long term photometric variability increases 
with increasing magnetic activity (e.g Radick et al. , 1998), this set of stars is 



statistically biased. Consequently, the study of Henry et al. ( 2000 ) cannot 



be used to argue that the Sun's photometric stability is representative of 
most stars. However, while not all planetary hosts will display the same 



photometric stability as the Sun e.g. CoRoT-2b (Alonso et al., 2008), this 



study does indicate that there is a substantial subset of planetary host stars 
which will. Within this context it was felt that the solar data was a suitable 
testbed for this analysis. 



For the case of the Sun, equation (9.11) becomes 



where €q is given by 



e 



PNobsLc 



£ 0> 



tmid. 



(9.13) 



(9.14) 



where the luminosities L(ti) are given by the total solar intensity measure- 
ments taken by SOHO. By evaluating cq for enough segments of solar light 
curve, an observed distribution for eq, and correspondingly ej, can be con- 
structed. This formulation will be used to analyse the cases of realistic and 
filtered stellar noise. 

Finally, it should be noted that the specifics of the behaviour of ej of 
course depend on the photometric behaviour of the host star selected and 
will not necessarily share the the same behaviour as derived for the Sun. 
Fortunately, ej can be derived for any given host star using the method 
demonstrated above, but using out of transit light curves as opposed to 
SOHO data. To ensure accurate assessments of moon detectability, this 
procedure would have to be completed for the host star of every transiting 
planet targeted for followup. 
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Now that the method and data have been discussed, we can move on to 
analysing the behaviour of €j for the three noise sources under investigation. 
First we will look at white noise. 



9.3 White noise: analytic derivation 
9.3.1 Introduction to white noise 

White noise is a type of additive noise which occurs in many physical pro- 
cesses such as photon counting. It is known as white, as the Fourier trans- 
form of white noise contains the same power at all frequencies. Using the 
analogy between frequency and colour, as there is no dominant colour, the 
noise is "white". 

In order for a discrete sequence of numbers x = {x±, X2, ■ ■ ■ Xj, . . . , x n } 
to describe white noise, two conditions must be satisfied. First, each of the 
Xj must be drawn from a Gaussian distribution, that is, they are described 
by the probability density function 



where \i and a are the mean and the standard deviation respectively. Second, 
the individual values of x must be uncorrelated, that is, the probability 
distribution for any Xj is unaffected by the values of any of the Xf. where 



Uncorrelated, normally distributed random variables, such as those used 
to produce white noise, have a number of useful properties. In particular, 
if X\ and X2 are uncorrelated normally distributed random variables with 
mean fix and \i2 an d standard deviation u\ and 02, then a linear combination 
of these variables, aX\ + bX2 is also normally distributed with mean /ii+2 
and standard deviation o"i+2 ; given by 




(9.15) 



O/il + bjl2 



(9.16) 




(9.17) 



9.3.2 Derivation of e, 



Restating the definition of ej given by equation (9.1), 



Zip -+- fi m 
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it can be seen that €j is denned in terms of a sum of values a n , drawn from 
a distribution which has first been premultiplied by the value U — (jT p + 
to). As €j is the sum of a set of normally distributed random variables, 
a n , it too is normally distributed. In addition, assuming the distribution 
of a n has mean and standard deviation &l we can use equations (9.16) 



and (9.17) to determine [i e and a e the mean and standard deviation of 
the distribution of €j respectively. Applying equations (9.16) and (9.17) 



iteratively to equation (9.1), we have that 



Me 







and 



Or 



Ap + A v 



lY^{u-{jT p + t )y. 



(9.18) 



(9.19) 



While equation (9.18) is fully evaluated, equation (9.19) is not. In order to 



evaluate this expression, we need to write ti in terms of physical variables 
such as the t m id >p , the midpoint of the window, At, the exposure length, and 
N b s , the number of exposures used to calculate r. Using these variables, ti 
can be defined 

U = t mid>p +(i- ^-j At. (9.20) 

Noting that the window over which the transit is examined is centered 
on the planetary transit, we have that the center of the j th transit will occur 
at 

tmid, P = jT p + 1 + At p , (9.21) 

where At p is the time delay in the center of the planetary transit due to 
the motion of the planet about the planet moon barycenter (i.e. the TTV& 
signal). Consequently t{ is given by 



ti = jTp + t + At p + 



(9.22) 



Now, as was shown in section 8.4.1.1 we know that the TTV;, signal is less 
than, or at the very least the same order of magnitude as the equivalent 
TTVp signal (At). As a result, as we have neglected At term from equa- 
tion (7.60), as it was much smaller than t{ — (jT p + to), we can also neglect 



the At„ term from equation (9.22) for the same reason 



Thus, writing out tj and the sum limits in full, we have that 



0"L 



Ap + A m y 



N obs -l 

E 

i=Q 



N obs 



At 2 . 



(9.23) 
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Expanding the term under the square root 



A„ + A rt 



and using the identities 



N ob s-l 



s Y, &t^-N obs AtH + ^Ati, 

\ i=0 



(9.24) 



i=0 



n(n + 1) 



i=0 



. 2 n(n +l)(2n+l) 



6 



(9.25) 
(9.26) 



equation (9.24) can be expanded to give 



Or 



Oh 
An + A r 




(9.27) 



In addition, as the first term is approximately N\ times the second 
term, where N obs , the number of exposures used to calculate r, is a large 
number, the second term can be neglected giving 



0-6 = L 



N obs At N obs 



12 



An + A Tl 



(9.28) 



To provide intuitive understanding of this expression, equation (9.28) was 
recast into physical variables, 5 giving 



a f = 47.9s 



&l/Lq 
3.95 x 10" 4 



At \ 1 2 
lmin 



100 (A p + A m ) 

NtraLo 



( T obs y/ 2 / T tra \ 

V24hr S y ViShrs/ 



\ -l 



. (9.29) 



To check this formula, a Monte Carlo simulation was run. A mock transit 
light curve was constructed using three straight line segments corresponding 

5 In particular the reference system is a Jupiter-like planet about a Sun-like star at an 
orbit of 1 AU, while the reference instrument is Kepler. The reference relative photometric 
accuracy was calculated by assuming that the nominal relative photometric pr e cision of 
2 x 10~ 5 in a 6.5 hour exposure for a magnitude 12 star (e.g. 



Borucki et al. 



2003| ) is 

dominated by shot noise, and consequently calculating the corresponding shot noise that 
would be observed in a one minute exposure. 
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Figure 9.2: Comparison between a Monte Carlo simulation (white bar) and 
theoretical prediction (black line) for the distribution of first moments for 
transit contaminated with white noise. 



to the ingress, flat bottom and egress. To this, Gaussian noise with a known 
standard deviation was added to produce a sequence of model transit light 
curves. The €j corresponding to each of these light curves was then calcu- 
lated. The histogram of €j along with the theoretical prediction are both 



shown in figure 9.2 As can be seen, the agreement is very strong. 



9.3.3 Properties of e. 



As can be seen from equation (9.28), the standard deviation of e,- for the 



case of white photometric noise depends on a number of physical quantities, 
in particular, the photometric accuracy, ox, the exposure time, At, the 
size of the planet (parameterised by A p ) and the length of the transit and 
observing windows. As a different physical process is highlighted in each of 



the three terms of equation (9.29), this equation will be used as a scaffold for 
this discussion. These three terms will be discussed in turn, in the context 
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of equation (9.29) and in the context of previous results assuming white 



photometric noise, in particular, those from Szabo et al. (2006). In addition 
€j will also be discussed in terms of two physically important quantities that 
it does not depend on, At and j, mainly as this independence leads to a 



number of important statistical properties that will be used in chapter 10 
to construct thresholds. To begin this exploration of the properties of ej, 
we look at the dependance of a e on the relative photometric accuracy per 
exposure and the exposure time. 



The first term in equation (9.29) describes both the dependence of a € 
on the relative photometric accuracy (ai/Lo) and the exposure time (At). 
Unsurprisingly, the smaller the relative photometric error, the smaller the 
value of a e and consequently, the smaller the error in r. However, while 
the relative photometric noise can be altered without changing the expo- 
sure time, for example, by moving to a larger telescope or by upgrading 
the instrumentation, changing the exposure time can alter the photometric 
noise depending on the source of the photometric noise. For the case of shot 
noise, error due to small number statistics, the total error is proportional 
to the square root of the total number of photons collected for that star 
(i.e. oc \^At) while the total intensity is proportional to the total number 
of photons (i.e. oc At), so the relative photometric error is proportional 
to ^fAt/At = Ai -1 / 2 . 6 For the case of read noise, error resulting from 
the process of measuring the number of photons collected at the end of an 
exposure, the total error does not depend on the exposure time. Conse- 
quently, the relative photometric error is proportional to (At) -1 . So, for 
a star with white photometric noise that is dominated by shot noise, a e is 
independent of exposure time, while for a star dominated by read noise, a t 
should decrease with increasing exposure time. 7 



The second bracketed term in equation (9.29) represents the effect of 
planet size on the size of a e . This can be seen by noting that L^Ntra is the 
total number of photons emitted by a star during a transit duration, while 
A p + A m is the number of photons blocked by the planet during the same 
time. Consequently, this fraction can be thought of as a ratio between the 
projected area of the planet-moon pair and the star on the sky respectively. 
While this initially seems to indicate that moons should be more detectable 
around large transiting planets as a result of their large A p +A m values, recall 



6 A similar argument can be constructed for the case of dark noise, the noise caused by 
electron motion in the CCD chip during an exposure. 

7 For the case where the relative photometric noise is independent of exposure time, 



such as the case explored by Szabo et al. ( 2006 1, cr E will decrease with decreasing exposure 



time, which is in agreement with their findings. 
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from section 8.4.2.1 that the amplitude of At also decreases as {A p + A r 



As both these terms can be thought of as representing the cross-sectional 
area of the planet, the gain in moon detectability caused by decreasing a e 
is matched by the loss in moon detectability caused by the decreasing At 
amplitude. 

The final term in square brackets describes the dependance of a e on the 
transit duration and the length of the observing window. Physically, this 
corresponds to investigating the effect on a e of the distance of the planet- 
moon pair from the star, and the distance of the planet-moon pair from each 
other. For a given planet-moon system, the optimal length of the observing 
window will scale with the transit duration. Consequently a e oc v^tra- 
This result, along with the dependance of At on transit duration, 8 implies 
that for the case of white photometric noise, the detectability of a given 
moon increases with increasing transit duration of its host planet. This is 



in agreement with the work of Szabo et al. (2006) who found that more 



distant planets (with larger transit durations) had more detectable moons. 
The relationship between a e and T b s (and consequently the planet-moon 
separation) is a little more involved as the position of the moon is not known 



a f increases as 



r 3/2 
l obs ■ 



before detection. From equation (9.29) it can be seen that as T a b s increases, 



Consequently it would be useful to use the smallest 
observing window possible, while still including the moon's transit. Consider 
the best case scenario, where the observing window selected is the shortest 
window such that the full transit of the moon will always be captured, 
that is, T b s = 2a m /v t r + T tra - Consequently for a small semi-major axis 
(2a m /v tr <C Ttra), &e will be independent of a m and for large semi- major axis 
(2a m /vtr 3> Ttra) is proportional to am ■ Comparing this to the result from 
section 8.4.2.1, that the amplitude of At is proportional to a m , it can be 



seen that very close and very distant moons are not particularly detectable. 



This is at odds with the result found by Szabo et al. (2006) in that they 



proposed that only close moons were undetectable. 

Now that the physical parameters that affect a e have been discussed, 
a e will also be discussed in the context of two additional parameters, At 
and the transit number j. In particular the fact that a 6 does not strongly 
depend on either At or j will be of great use in constructing thresholds in 



chapter 10 We begin with an investigation of the relationship between a e 
and At. 



The first parameter of interest is At. As can be seen from equation ( 9.29 ) , 



Recall from section 



8.4.2.1 



that the amplitude of At is inversely proportional to Vt r 



and thus proportional to Tt, 
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a t does not depend on the measured value of At for that transit. This 
lack of dependance is partially a result of neglecting the At term in equa- 



tion (7.60) (as it was small) and partially as a result of neglecting higher 



order terms in the expansion of equation (7.51) required for the derivation 



of equation (7.60) (see section 9.2.1 and appendix UJ) . While a e does not 



formally depend on At, it does depend on a quantity that changes as At 
changes, (A p + A m ). However, for the purposes of determining the detection 
threshold it acts as a constant, 9 and so can be ignored. This weak depen- 
dence of €j on At means that the noise in t can be approximated with little 
to no knowledge of the orbital elements of any putative moons. This prop- 
erty will simplify the mathematics in the next chapter, when the expressions 
describing At and €j are combined to determine the set of detectable moons. 

The second parameter of interest is the transit number, j. To under- 
stand the (lack of) dependance of ej on the transit number j, consider the 
following. Prom the definition of white noise, the error in each exposure is 
uncorrelated to the error in any other exposure. Consequently, the error 
in any quantity calculated from a sequence of exposures (such as t) should 
be uncorrelated with the error in that quantity calculated from a different 
sequence of exposures. That is, for the case of white noise, ej is independent 
of the transit number j. The fact that the errors in r for consecutive transits 
are uncorrelated with each other is another very useful statistical property 
in terms of fitting the sequence of transits, and consequently, calculating 
detection thresholds. 

Now that we have explored and discussed the properties of ej within 
the context of white noise, we have an understanding of the types of be- 
haviour that photometric noise can have on r. We will use this as a basis 
for investigating the effect of more complex types of noise on the error in t, 
and in particular, move onto the case where realistic (solar-like) noise is the 
dominant noise source. 



The statistical method that will be used in chapter 10 uses ratios between the signal 
and the noise to construct thresholds and will only be evaluated to first order in v m /v t r- 
As these ratios will only be evaluated to first order in v m /v t r, and At is already first order 
in Vm/vtr, only the zeroth order component of A p + A m need be retained i.e. A v + A m — 
Ap + A m - The disappearance of these first order terms is heartening as they depend on the 
orbital parameters of the system in a non-trivial way. This issue will be further discussed 
in sections 10.4 and 10.6 
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Figure 9.3: Comparison between white noise (top panel) and red noise (bot- 
tom panel). 



9.4 Red noise: observational derivation 



9.4.1 Introduction to red noise 



Red noise is additive noise which contains an excess of power in lower (or 
"redder" ) frequencies. While in signal processing red noise is a slang term for 
brownian noise or random walk noise and has spectral density proportional 
to f~ 2 , where / is the frequency, in astrophysics, the term is colloquially used 
to describe any noise process which produces an excess of low frequencies. 
For a more in-depth introduction into correlated noise processes within the 



context of transits, please see Carter Sz Winn (2009). 



The fact that stars in general (e.g. 
1982) and the Sun in particular (e.g. 



component in their light curves has 



Radick et al. 



1982 



Dorren & Guinan 



Willson et al. 1981) have a red noise 



ong been known. Due to this excess 



of long wavelength components, consecutive data points in a light curve 
dominated by the intrinsic variability of the host star are correlated, that 



207 



is, there are long term trends in the data (see figure 9.3). Consequently, 



the statistical methods used in the previous section no longer apply. As 



discussed in section 9.2.2, for the case of realistic photometric noise, the 
distribution and behaviour of €j will be calculated numerically using solar 
photometric light curves. 

9.4.2 Derivation of €j 

For this thesis, Total Solar Intensity (TSI) data, taken using the the DI- 
ARAD instrument on the SOHO satellite was used. In particular, DIARAD 
TSI data taken between 30/1/96 to 1/9/07 was downloaded from the SOHO 
website. 10 This data was in the form of a FITS compliant text file (see fig- 
ure 9.4). To indicate a NULL reading, the intensity 99.99 and the flag 99 
were used. There were a number of issues with this data. First some of 
the data was not available for certain dates (presumably as the photometer 
was not running). Second, during days where data was available, it was not 
available for all times. Finally, numerous formatting inconsistencies, such 
as the running together of the last intensity value and the NULL flag and 
the value of the recorded intensities reversing sign, meant that preprocessing 
was required. 

To deal with the issue of missing days and formatting inconsistencies, a 
Perl program was written to reformat the data. This program: 

• Read in all the data from the data files. 

• Inserted the correct number of NULL values (99.99 and 99) for each 
missing day. 

• Separated the intensity value and the 99 from the end of each measured 
sequence. 

• Checked the status flag for each exposure, and if the code was not 
valid (7^ 16380) then set the intensity and the intensity flag to 99.99 
and 99 respectively. 

• Reversed the sign of the intensity if it was negative. 

• Corrected the intensity for instrumental variation using the table given 
on the SOHO website. 11 

10 http://seal. nascom.nasa.gov/cgi-bin/gui_plop. 

11 ftp://ftp.pmodwrc.ch/pub/data/irradiance/virgo/TSI/korr_tot6_002_0904.dat. 
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DIARAD DIARAD PM0V6V PM0V6V 



T bs 


3 mins 


30 mins 


1 min 


30 mins 


30 mins 


163039 




132185 




1 hr 


78971 


76956 


63802 


61681 


2 hr 


37208 


36244 


29736 


29604 


4 hr 


16503 


16113 


12657 


12153 


8 hr 


6471 


6318 


4077 


3679 


12 hr 


3438 


3365 


475 


462 


16 hr 


2080 


2038 


238 


228 


24 hr 


824 


805 


73 


71 


36 hr 


317 


307 


54 


53 



Table 9.1: Number of complete lengths of data of a given duration, T b s , 
for each of the four possible data sets. 



• Output the intensity of each exposure and the time in minutes since 
the beginning of observation, in this case, since 30/1/1996, into a single 
text file. 

Now that a corrected TSI time series is available the set of variable values 
for which the distribution of €j will be evaluated needs to be considered. 

As this is a numerical investigation into ej the set of variable values for 
which the distribution of e,- will be evaluated needs to be selected. Informed 
by the work on white noise, physically important variables include T b s , Tt ra , 
At, j3 and A p +A m . However, as the terms /3 and A p +A m (and consequently 



T tra ) only appear in the premultiplied factor in equation (9.13), not in sq 
we only need to consider a grid of T b s and At values. The grid of values 
selected for this analysis will be discussed. 

An investigation of the effect of the size of the observation window into 
the distribution of ej is pertinent for two main reasons. First, to document 
the effect of the size of the observing window for cases relevant to the set of 
planets likely to be discovered by COROT and Kepler. Second, to check the 
method by comparing the behaviour of €j with the behaviour calculated for 
the case of white noise. First, the range of observing windows relevant to the 
COROT and Kepler missions needs to be determined. As the observation 
window cannot be smaller than the transit duration, the minimum obser- 
vation window is limited by the minimum transit duration. As discussed 
in chapter [3j it is unlikely that moons will be discovered around planets 
with semi-major axes smaller than 0.2AU as a result of moon orbital evo- 
lution and subsequent loss. In addition, as a result of the finite duration of 
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the COROT and Kepler missions, and thus the limited number of transits 
which can be observed per planet, it is unlikely that moons will be discov- 
ered around planets with semi-major axes larger than 1AU. For a Sun-like 
star, these two limits correspond to a transit duration of 8 hours and 12 
hours respectively, for the case where the planet passes across the central 
chord of the star. The actual transit duration may be shorter depending on 
the inclination of the planet's orbit to the line of sight. As 86.6% percent 
of transiting planets will have transit durations longer than 50 percent of 
the nominal duration, a conservative estimate of the minimum length ob- 



servation window is four hours. Also, as will be discussed in section 10.4 



the most detectable moons are located between 1/2R S and 2R S from the 
host planet, the exact value depending on the behaviour of the noise on the 
light curve. For the case of planets at 0.2AU and 1 AU, this corresponds to 
an observing window of length 16 and 24 hours respectively, for a central 
transit. As we would not only like to search for the most detectable moons, 
it was decided to investigate observing windows with lengths between 4 and 
36 hours. Second, in order to compare the results of this analysis with that 
of the analysis of white noise, we need to select the set of observing windows 
for which the data most resembles white noise. The shorter the observing 
window, the smaller the effects of the long term trends caused by the red 
noise component, the more "white", the noise should be. Consequently it 
was decided to also investigate observing windows which were shorter than 
four hours. As the largest exposure time used by COROT and Kepler is 30 
minutes it was decided that this would be the shortest observing window 
investigated. As a result of these two factors, it was decided to investigate 
the cases where the observing window was of length 30 minutes, 1 hour, 2 
hours, 4 hours, 8 hours, 12 hours, 16 hours, 24 hours and 36 hours. 

An investigation of the relationship between exposure time and the dis- 
tribution of €j may allow optimisation of the selected observing strategy 
with respect to moon detection. As the satellite missions COROT and Ke- 
pler are the most likely to find the types of planets which could harbour 
moons and also have the required sensitivity to do the TTV P followup, it 
is most useful to investigate the effect of exposure time on ej with respect 
to the capabilities of these satellites. Both these satellites are capable of 
long and short exposures. For the case of COROT, a 512 second exposure 
time is used for its catalogue of approximately 12000 targets, but it is also 



capable of 32 s readout for a subset of 1000 highlighted sources (Quentin 



et al. 2006). For the case of Kepler, a thirty minute exposure time is used 



for its set of 3000 sources, but it is also capable of one minute exposures 
for a subset of 512 highlighted sources (Borucki et al. 2008). So, ideally, 
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it would be useful to compare the distribution of €j for the case of "short" 
exposures (32 seconds or 1 minute) and "long" exposures (16 or 30 minutes). 
Unfortunately, PM06V and DIARAD, the two instruments on the SOHO 
satellite, have exposure times of 1 minute and 3 minutes respectively, so, 
solar data with cadence below 1 minute are not available. In addition, while 
data with cadence of 1 minute are available, it is quite patchy. As a result 
of this patchiness, there are only a small number of long contiguous lengths 



of data which are 24 or 36 hours long (see table 9.1). As we would like to 
investigate the case of short exposure times for long observing windows, it 
was decided to use the 3 minute cadence DIARAD data. To maximise the 
difference between long and short exposures it was decided to investigate the 
case of 30 minute exposures. These exposures were created by first divid- 
ing the three minute data into thirty minute intervals. For the case where 
all the data within a given interval was valid, the average of the intensities 
of the constituent three minute exposures was recorded as the intensity for 
that 30 minute "exposure" . For the case where some of the data within the 
interval was not valid, a NULL intensity was recorded for that 30 minute 
"exposure" . 

To implement this numerical investigation into €j for this grid of T b s 
and At values, a second program was written. It read in either the 3 minute 
or 30 minute corrected TSI values and divided each section into "transits" 
of a given duration T b s , ignoring any remainder. As discussed above, the 
distribution of ej can be calculated from the TSI directly and then scaled 
by /3LoN bs/(A p + A m ) for the transit in question. Thus, for each of these 
lengths, the error associated with each of these transits were calculated and 



then binned to give an experimental distribution of €q (see figure 9.5). As 
these distributions were approximately Gaussian for all T Q b s examined it 
was decided to model the distribution using the standard deviation. Using 
the MATLAB function fminsearch to perform a least squares fit to the 
histogram, /xq and <tq, and the errors associated with these values were 
derived. In all cases the mean was approximately zero, consequently only 
ctq was recorded. 



The results are presented in figure 9.6 and table 9.2 As can be seen 



in table 9.2 the values of <7q corresponding to the two different exposure 
times for T b s greater than 4 hours agree within error bounds. As the vast 
majority of realistic values of T b s are likely to be greater than four hours, 
these two data sets are effectively equal. Performing a quadratic fit to this 
data, the following formula for the standard deviation of cjq is obtained 



CT 



5.32 x 10~ 3 s + 6.48 x 10~ 3 s hr^T^ + 6.96 x 10~ 4 s hr~ 2 T 2 bs (9.30) 
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SIMPLE = 
BITPIX = 
NAXIS = 



T 

8 




/ file does conform to FITS standard 
/ number of bits per data pixel 
/ number of data axes 



EXTNAME = 'DIARAD LEVEL V 
TNULL1 = '99.99' 



TNULL2 
END 
99.99 
99 

99.99 
99 

1366.747 
16380 



= '99' 
99 

99.99 
99 

99.99 
16380 



99.99 
99 

99.99 
99 

1366.952 



99 

99.99 
99 

99.99 
16380 



/ name of this ASCII table extension 
/ Undefined value SOLAR_CT 
/ Undefined value STATUS 



99.99 
99 

99.99 
99 

1366.827 



99 

99.99 
99 

99.99 
16380 



99.99 
99 

99.99 
99 

1366.856 



1366.790 16380 



1366.870 16380 



1366.781 16380 



1366.958 

16380 

99 



16380 

1366.795 

99.99 
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Figure 9.4: Abridged example DIARAD data file. In this case, the file 
corresponds to the measurements taken on the 1/1/2000. 



Substituting this into equation (9.13) and recasting into physical variables 



gives 



a f = 103.7s 



100(4, + A m ) 



(l3hr J 



x [ 0.010 + 0.277 (^] +0.714 



(9.31) 



Using this result, some general comments can now be made about the be- 
haviour of €j for the case where the light curve is dominated by realistic 
stellar photometric noise. 
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(e) T oba = 12hr. (f) T obs = 24hr. 

Figure 9.5: The observationally determined distribution of €q (red bar) for 
the case of realistic solar photometric noise for six different length observing 
windows. Note that for observing windows less than 12 hours long, the 
distribution of €q strongly represents a normal curve (black line). 
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± obs 


At = 3 mins 


At = 30 mins 


30 mins 


9.492 x 10" a s 




1 hr 


1.462 x 1(T 2 s 


1.261 x 10~ 2 s 


2 hr 


2.326 x 1(T 2 s 


2.259 x 10~ 2 s 


4 hr 


4.18 x 1(T 2 s 


4.18 x 10~ 2 s 


8 hr 


9.90 x 10~ 2 s 


9.90 x 10~ 2 s 


12 hr 


1.77 x 10- 1 s 


1.79 x 10- 1 s 


16 hr 


2.88 x lO" 1 s 


2.88 x lO" 1 s 


24 hr 


5.7 x lO" 1 s 


5.7 x lO" 1 s 


36 hr 


1.1 s 


1.1 s 



Table 9.2: The size of <tq as a function of length of observation window, T b s , 
and exposure time At. The value of <tq is recorded to the last significant 
figure. 



9.4.3 Properties of e. 



As for the case of white noise, the properties of €j will be discussed us- 
ing equation (9.31) in terms of /3, At, A„ + A m , T tra and T b s . Then the 



discussion will proceed to a more general investigation in terms of Ar and 
3- 



The first term in equation (9.31), describes the effect of the amplitude 



of the photometric noise relative to the amplitude of the photometric noise 
of the Sun, through the term f3. Unsurprisingly, the larger the amplitude of 
the intrinsic photometric noise of the star, the larger the resulting error in r. 
This is equivalent to the linear dependance of ej on gl for the case of white 
photometric noise. Also, as discussed previously, for the case of realistic 
solar photometric noise, the distribution of ej does not seem to depend on 
the exposure time. This again is unsurprising as the noise is inherent to the 
host object, not on the way in which it is measured, so altering the exposure 
time merely acts to smear out the photometric noise, not alter its form or 
amplitude (or its effect on ej). 

Similar to the case of white photometric noise, the second term of equa- 



tion (9.31) represents the effect of the relative size of the planet compared 



to the star. As this dependance is a property of equation (9.31), it should 
not be surprising. 



The third term of equation (9.31) describes the dependance of ej on 



planet-star separation and planet-moon separation through the transit du- 
ration Tt ra and the observing duration T Q i, s respectively. As mentioned in 
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Duration of Transit (hr) 



Figure 9.6: Comparison between calculated errors in r using solar light 
curves (red dots) and theoretically predicted errors in r using white noise 
with the same power as the noise in the solar light curves (thick line). The 
method used to derive this white noise amplitude is presented in appendix 
[K} As the results for 3 minute and 30 minute exposures are so similar, only 
the data for 3 minute exposures is shown. These relations are shown for 
the case of the transit of a gas giant ((A p + A m ) / L^N^a = 10~ 2 ) and a 
terrestrial planet {{A p + A m ) / LoNt ra = 10 -4 ), assuming T f, s ~ 2Tt ra - 



the section on white noise, the optimal observing duration scales as the tran- 
sit duration for a given planet-moon pair. Applying this to equation (9.31) 
we find that Ej is a superlinear function of transit duration. In addition, it is 
also a superlinear function of observing duration (a good proxy for planet- 
moon distance). Comparing these results to the relations for the case of 
white noise (e j oc T^J a and Cj oc T f J JS ) and to the amplitude dependance of 
At (At oc Ti ra ), it can be seen that the addition of realistic photometric 
noise to a light curve makes a substantial difference to moon detection. In 
particular, for the case of realistic photometric noise it would be easier to 
detect moons which are closer to their host planet and to detect moons of 



215 




10 20 30 40 50 60 70 
Time (days) 



Figure 9.7: Unbiased autocorrelation function of €j (red line) calculated 
using realistic stellar noise for the case where the planet is Jupiter-like {{A p + 
A m ) I 'LoNfra = W~ 2 and T b s ~ 2T tra ) and where the observing window is 
8 hours. For reference, the one sigma error bars (pink) are also shown. As 
short period planets are more likely to be discovered by the transit technique, 
the inner section of the autocorrelation function is shown in the main plot. 
For completeness, the full autocorrelation function is shown in the inset. 



planets with shorter transit times, due to, for example, an inclined orbit. 

As with the case of white noise, equation ( |9.31 ) does not depend on 
At, again as these dependancies were neglected (as a result of their small 



contribution) in the derivation of equations (7.60) and (9.10). By analogy 



with the case of white noise, we would expect this discrepancy to become 
large for large At, or for the case where the noise amplitude is large relative 
to the transit depth (that is, for large j3). 

In addition to determining the distribution of €j, it is also important 
to determine whether or not consecutive values of €j are correlated. Con- 
sequently the autocorrelation 12 of the sequence of e,- values was computed 



12 For this work the autocorrelation is defined as Ylj £j£j+k/Nk where tj and tj+k rep- 
resent all tj pairs separated by k x 8 hours and Nk is the number of such pairs. This 
unbiased definition was selected to allow ease of comparison between values calculated for 
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(see figure 9.7). From physical intuition, we would expect that there would 
be some correlation on timescales comparable with the rotational period of 
the Sun (~ 25 days) as the gradient of the luminosity is also correlated over 



those timescales. As can be seen from figure 9.7, e,- is effectively uncorre 



lated for all transiting planets with orbital periods above forty days. As 
the shortest interval between consecutive transits for a planet with stable 
moons around a Sun-like star is approximately one month, only the region 
of the autocorrelation curve corresponding to orbital periods greater than 
one month need be examined. Consequently, for nearly all planets likely to 
host moons, €j is uncorrelated. 

Now that the case of raw realistic stellar noise has been investigated, the 
analysis can be refined by "filtering" the data to remove long term trends. 
In the case to be investigated these trends are due to the advection of active 
regions across the solar surface. 



9.5 Filtered red noise: observational derivation 
9.5.1 Introduction to filtered noise 

As the presence of red noise in light curves significantly alters the detection 



probability of transiting planets (e.g. Borucki et al. 1985, Pont et al. 2006), 
a range of methods for reducing the effect of this type of noise have been 
investigated. These include methods where the filtering is performed in 



combination with a transit finding algorithm (e.g. Jenkins, 2002), methods 
where the red noise is preprocessed with a whitening filter before transit 



detection (e.g. Carpano et al. 2003; Guis & Barge, 2005 Moutou et al 



2005 team 3), and methods where the physical processes believed to be 



underlying the red noise are modeled (Lanza et al. 2003). 



Since for this application moon searches will be conducted on a star 
by star basis, it was decided to use a method which modeled the physical 
processes appropriate to each star. In particular it was decided to use the 



three spot model method of Lanza et al. (2003) as: 



This method was developed using SOHO data. 

It was designed to be adapted to model photometric variation for any 



Sun-like star (Lanza et al. 2004) 



It has been used to model variability in real stellar data, specifically, 



different lag times. 
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the COROT targets CoRoT-Exo-2 ( [Lanza et ak} |2009b[ ) and CoRoT- 
Exo-4 (|Lanza et "aT| |2009a"|) . 



For completeness, the particulars of this method will be discussed. 



9.5.2 Description of the three spot model 



One source of long term photometric variability of the Sun is rotational 
modulation of the active regions, that is, regions on the Sun's surface where 
the magnetic field strengths are high. Active regions are complex structures 
comprising of clusters of magnetic phenomenon such as darker sunspots and 
brighter faculae. While there is a log normal distribution of active region 



sizes (Bogdan et al. 1988), Lanza et al. (2003) determined that the long 



term photometric variation of the Sun could be effectively modeled by only 
accounting for three distinct active regions on the face of the star in any 
given fourteen day window. 

To perform the fitting required to implement this "three spot model" the 
extent to which the substructure of the active regions needs to be modeled, 
must be determined. In particular, the issues of active region composi- 
tion, evolution and distribution across the solar surface need to be explored. 
While the ratio between the area of faculae and starspots changes as a func- 



tion of active region size and position in the solar cycle (Chapman et al 



1997), Lanza et al. (2003) assumed a constant ratio of sunspot area to facu- 



lar area of 1:10. Consequently this assumption was also used for this thesis. 
Not only are active regions composed of complex structures, these structures 
evolve over time. Fortunately, the timescale for active region evolution is 
longer than that for solar rotation. Consequently, the solar intensity data 
can be divided into blocks, such that it can be assumed that the position and 



size of an active region doesn't change within the block. Lanza et al. (2003) 



found that a good compromise between modeling the effect of solar rotation 
and minimising active region evolution was 14 days. 13 Consequently, for 
this thesis, data was divided into 14 day segments, each consisting of 6720 
data points. In addition, as active regions are generally much smaller than 
the radius of the Sun, the entire active region can be modeled as having the 
same fi value, where the \i value is given by the cosine of the angle between 
the surface normal and the direction of the line-of-sight at that position 
on the solar surface. Consequently, three variables are required to model 



13 The optimum block length depends on the characteristics of the host star. For exam- 
ple, for the case of the Sun the optimal length is 14 days, while for the case of CoRoT-Exo-4 



a length of 8.2 days (Lanza et al. 2009a I was used 
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each active region, two describing the position of the region on the face of 
the Sun, and one variable describing the effective area of the active region. 
With these assumptions and simplifications in mind, the fitting model can 
be introduced. 



Following Lanza et al. (2003), this results in the following model for L, 



the photometric intensity of the Sun 



L - L = L r + LqC ^ Aim K + b p^ + c v$\ 

i=l:Hi>0 

x [(c s - 1) + Q(c f + d f m - 1)] , (9.32) 
where the m correspond to the \x values of each of the star spots, where /Zj 



will be defined in equation (9.33). For this study the parameters relating 

j 



to the intrinsic properties of the Sun, Lq, C, a p , b p , c p , 



Q, 



were set to the same values used in Lanza et al. (2003). For completeness 



Cf and 



7 



the values and the purpose of each of these model parameters will be stated 
and explained. The limb darkening parameters, a p , b p , c p and C were set 
to 0.36, 0.84, -0.20 and 4.88 respectively. In addition, the parameter c s , 
which describes the dependence of sunspot luminosity on position was set 
to 0.67, and the parameters c/ and which describe the dependence of 
facular luminosity as a function of position were set to the solar values of 



1.115 and -0.115 (Foukal et al. 1991 ). Q, the ratio of sunspot area to faculae 



area was set to 10. As discussed in Chapman et al. (1997), this ratio can 



change throughout the eleven year solar cycle. 

In addition to the variables remaining constant for all fits, eleven vari- 
ables are allowed to change. First the additional background intensity is 
described by L r . Second the relative areas of the three starspots are given 
by Ai, A2 and ^3. Finally, the effect of the position of the star spots is 
parameterised by their \i value, defined as 

fii = cos i sun sin 0{ + sin i sun cos 9{ cos(A« + Qt — Lq) (9.33) 

where f2 is the angular velocity of the Sun due to its rotation and Aj and 9i 
are the longitude and latitude of the i th starspot. Consequently, the eleven 
variables which must be fitted for this model are A%, Ai, 0\, A2, A2, 62, A3, 
A3, 63, L r and O for each 14 day segment of data. The fitted value of these 
variables is such that the x 2 value, defined as 



1 



X 



M 

M 2 



(L(ti)-L m (ti)f 



or 



(9.34) 



219 



where L is the observed luminosity, Lja is the fitted luminosity and where 
a = 2 x 10~ 5 Lo, is minimised. 

In addition to fitting these eleven variables, there are a number of con- 
straints imposed by the physics of the problem on the values the variables 
can take, in particular with respect to starspot areas and the rotation rate. 
The values of the starspot areas were constrained to be positive and below 
a threshold size. Lanza et al. (2003) calculated this threshold size to be 



8.2 x 10 -4 by assuming that the largest dip in the data was due to the rota- 
tional modulation of a single active region. However, as the span of SOHO 
data used for this thesis is longer than that used by Lanza et al. (2003), the 
upper limit derived may not be able to explain all the behaviour in the light 
curve analysed. In particular there is a region of apparently high sunspot ac- 
tivity occurring between 15/10/2003 and 11/11/2003 containing dips which 
correspond to spots with a relative area of 1.7 x 10~ 3 . Consequently, the 
upper limit of 8.2 x 10 -4 on starspot area was used for all 14 day blocks 
except for the two blocks between 15/10/2003 and 11/11/2003, where the 
limit 1.7 x 10 -3 was used. In addition to starspot area, solar rotation rate 
is an observationally constrained quantity. For this work, the limits of 23.0 
days to 33.5 days on the solar rotation period, derived by Lanza et al. (2003), 
were used without modification. 

For this thesis, the simplex method described by Lagarias et al. (1998), 
implemented using the multidimensional Matlab fitting function f minsearch 
was used to complete this constrained minimisation. This approach required 
that starting values and a function for calculating x 2 were provided to the 
method. The starting values for each fit were either taken from the previous 
fit, or manually estimated. As suggested by Lanza et al. (2003), the vari- 
ables, A\, Ai, 6*i, A2, A2, #2, ^3, A3, #3 and L r were fitted independently 
to O, which was then optimised. In addition, to ensure that the starspot 
areas remained within the physical bounds, the function which returned the 
X 2 value was modified such that it returned a very large value if active re- 
gion areas became too large or negative. Unfortunately, while these inputs 
resulted in a set of acceptable values for the fitting parameters being re- 
turned, issues with the simplex method itself, such as the fact that it may 
not converge (e.g. McKinnon, 1999), required that a number of tests were 
conducted to ensure that the true minimum had been found. 

To ensure that the fit had converged, three checks were made. First, each 
fit was checked by eye. Second, for each block of data, the fitting procedure 
was repeated, that is the output of the preceding fit was used as the starting 
variables for the next fit, until the fitted variables remained approximately 
the same between two successive fits. Third, the dependence of x 2 on each 
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of the eleven fitting parameters was checked for 29 randomly selected blocks. 
All curves inspected indicated that for these blocks the simplex method had 
converged to the x 2 minimum. 

Once the fitting had been performed, the best fit parameters were recorded 
in a text file. These parameters were in turn used to produce a model light 
curve. By subtracting the model from the data and then adding the average 
intensity, a detrended light curve was constructed. 



9.5.3 Derivation of 6j 

This detrended data was then subjected to the same procedure as that of 



the unfiltered red noise (see section 9.4) to determine function of 



transit area, transit duration and exposure time. The results are presented 

Again, as the data for 3 and 30 minutes are 



in table |9.3| and figure |9.9[ 
effectively equal, only the three minute data was used. Fitting a line to 
these data points gives 



(TQ 



-2.16 x 10~ 3 s + 1.21 x l(T 3 s hr _1 T t 



obs ■ 



(9.35) 



Substituting this into equation (9.13) and recasting into physical variables 



gives 



a f = 53.2s I 



100 (A p + A 



( Ttra \ 

Vl3hr7 



LoN tra 
-1 / 



8 x lO- 3 ( ^) + 1.008 ( 



(9.36) 



As with the previous two types of noise, the behaviour of e,- for the case of 



filtered photometric noise will be discussed in the context of equation (9.36). 



9.5.4 Properties of e. 



The effect of filtered photometric noise on e,- will be discussed with respect 



to the terms in equation (9.36) and with respect to At and j. In particular, 



these results will be compared to the case of unfiltered noise. 



Again, from the first two terms, equation (9.36) depends linearly on (3 
and is inversely proportional to A p + A m . As these dependancies have been 
found for the case of white and realistic stellar noise, they will not be further 
discussed. However, the third term shows some difference. 

The difference between the cases of filtered and unfiltered realistic pho- 



tometric noise become apparent in the third term of equation (9.36), which 
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Figure 9.8: The observationally determined distribution of 6q (green bar) for 
the case of filtered solar photometric noise for six different length observing 
windows. Note that the distribution of 6q strongly represents a normal 
curve (black line). 
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T f, s At = 3 mins At = 30 mins 

30 mins 9.4764 x 10 _:J s 1 

1 hr 1.446 x 10~ 2 s 1.246 x 10" 2 s 

2 hr 2.255 x 10~ 2 s 2.179 x 10~ 2 s 
4 hr 3.77 x 10~ 2 s 3.78 x 10~ 2 s 
8 hr 7.52 x 10~ 2 s 7.38 x 10~ 2 s 
12 hr 1.16 x lO" 1 s 1.16 x 10" 1 s 
16 hr 1.64 x 10" 1 s 1.64 x 10" 1 s 
24 hr 2.7 x 10" 1 s 2.7 x 10" 1 s 
36 hr 4.2 x 10" 1 s 4.2 x 10" 1 s 

Table 9.3: The size of cjq as a function of length of observation window, T b s , 

and exposure time At. The value of <tq is recorded to the last significant 
figure. 



describes the behaviour of ej as a function of planet-star and planet-moon 
distance through Tt ra and T b s . Noting that the T D b s /24hr term in the Taylor 
expansion is dominant, a e is proportional to Ti ra . As the signal amplitude is 
also proportional to T tra , this means to first order, the detection threshold 
does not depend on the transit duration (and thus the planet-star distance) 
for the case where the light curve is contaminated with filtered realistic so- 
lar photometric noise. Similarly, the amplitude of ej is proportional to T 2 6s 
for constant transit duration where the observing window is being altered 
to look for more distance moons of a given planet. Again this physically 
corresponds to very close and very distant moons being undetectable. 

Finally, as for the cases of white and realistic photometric noise, the 
dependance of ej on At was negligible. In addition, the autocorrelation (see 
figure 9.10) was again calculated, and it was found that ej is again effectively 
uncorrelated from transit to transit. 

Now that we have investigated the properties of ej for the case of filtered 
noise, we will summarise the behaviour of ej resulting from the different 
types of noise before continuing with the analysis. 



9.6 Conclusion 

As discussed, ej values resulting from white, realistic and filtered realistic 
photometric noise show quite similar behaviour in some ways, for example, 
their dependance on A p + A m , but quite different behaviour in others, for 
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Figure 9.9: Comparison between calculated errors in r using solar light 
curves which have been "filtered" using the method of Lanza et al. ( 2003 ) 
(green dots) and theoretically predicted errors in r using white noise with 
the same power as the noise in the solar light curves (thick lines) . In addition 
the errors in r for the case of unfiltered noise are also plotted for comparison 
(red dots). These relations are shown for the case of the transit of a gas giant 
((A p + A m ) / L N tra = 10~ 2 ) and a terrestrial planet ((A p + A m ) / L N tra = 
10 -4 ), assuming T obs « 2T tra - 



example, their dependance on Tt ra . These differences in behaviour will be 
summarised in turn, with the aim of highlighting the results required for 
later work. 

In general, the amplitude, form or behaviour of €j does not depend on 
exposure time. The only case where it does (among the three noise sources 
investigated) is for white noise processes which are dependent on the be- 
haviour on the instrument, e.g. read noise. For these cases, the exposure 
time should be optimised with respect to instrumental specifications and 
not the physics of the planet-moon system. 

For all three types of noise, the amplitudes of tj were found to scale 
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Figure 9.10: Normalised autocorrelation function of €j (green line) calculated 
using filtered stellar noise for the case where the observing window is 8 hours. 
For reference, the one sigma error bars (light green) are also shown. As short 
period planets are more likely to be discovered by the transit technique, the 
inner section of the autocorrelation function is shown in the main plot. For 
completeness, the full autocorrelation function is shown in the inset. 



linearly with the amplitude of the noise, while the proportionality constant 
depended on the specific type of noise (white, realistic or realistic filtered) 
and the physics of the system. As this is a property of equation ( 9.13| ), it 
should hold true for all types of photometric noise. 

The behaviour with respect to T tra and T Q b s was found to vary signif- 
icantly with noise type. For the case where the planet-star distance was 
altered without affecting the transit geometry, that is, altering Tt ra , the de- 
tectability of moons increased with, decreased with and was independent of 
planet-star distance for the case of white noise, realistic stellar noise and 
filtered realistic noise respectively. For the case where the planet-moon dis- 
tance was altered (that is, T f, s changes but T tra does not) all these three 
noise types show similar behaviour, in that there is an optimum planet- 
moon distance for moon detection, and moons that are closer or further 
away than this are less detectable. In particular, this optimal distance is 
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largest for white noise, smallest for red noise and intermediate for filtered 
noise. The exact position will depend on the star and the transit geometry, 
an issue that is further investigated in the next chapter. 

Finally, for realistic planet-moon systems, the distribution of €j can be 
well approximated by a normal distribution for the case of white and fil- 
tered realistic photometric noise. For the case of realistic photometric noise, 
the distribution of Ej is well approximated by a normal distribution for ob- 
serving window lengths shorter than about 12 hours, but becomes slightly 
non- normal for the case of observing windows longer than about 12 hours. 
In addition, the values of tj corresponding to a sequence of transits are un- 
correlated, and only very weakly depend on the values of At corresponding 
to the that transit. These three very useful statistical properties will be 
exploited in the next chapter. 

Now that the effect of three physically sensible noise sources have been 
investigated, we can now combine the results from chapter [8] with those of 
this chapter to produce detection thresholds. 
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Chapter 10 



Detection Threshold for 
Moons of Transiting Planets 



10.1 Introduction 



Now that At, the form of the TTV P perturbation has been derived, and 
the behaviour of ej, the timing noise, has been explored, we are finally in a 
position to determine the set of moons that can be detected using the TTV P 
technique. We begin by first motivating and defining the set of planetary 
systems that will be explored in this chapter. Then the method used to de- 
termine these thresholds will be discussed and the mathematics required to 
calculate these thresholds introduced and defined. Then using these formu- 
lae, the general behaviour of the detection threshold will be explored using 
simplifications that occur when the number of transits is large. This analy- 
sis will then in turn be used to perform a comparison between the detection 
threshold calculated using the TTV P technique and the detection thresholds 
corresponding to the three other transit moon detection methods calculated 
and discussed in section 4.3.2 Finally the TTV P moon detection threshold 
will be numerically calculated for realistic systems to investigate the effect 
of inclination, eccentricity and orientation of the planet's orbit for the case 
where the number of observed transits is finite. However, to perform such 
an investigation, the region of parameter space to be investigated, must be 
defined. With this in mind, we begin with a discussion of the region of 
parameter space to be explored. 



227 



10.2 Region of parameter space considered 



While it would be nice to have a simple two-dimensional plot which describes 
moon detection in terms of all moon detection behaviour, this is not possible, 
as the detection thresholds depend on many more than two parameters. In 
particular, they depend on the physical and orbital parameters of the star, 
planet and moon in question. Thus, the variables to be investigated have 
to be selected very carefully to make best use of this wealth of information. 
Consequently we will discuss the selection of variables relating to moon 
detection with respect to the star, planet and moon in turn. 

Transiting planets have been found around stars with radii ranging from 
0.2i?Q to 2.1.Rq. However, while the physical parameters of stars along 
the main sequence can be easily derived (e.g. R s oc M s ), the behaviour of 
the photometric noise of stars as a function of their position on the main 
sequence is not a simple function of their physical parameters. 1 As the 
analysis conducted in chapter [9] into realistic and filtered photometric noise 
only applies to the Sun, it can only be extended to other solar-like host 
stars. Consequently, in this work it was decided to investigate the case of 
a 12 magnitude Sun-like host star with photometric noise which is either 
dominated by white, realistic solar or filtered solar photometric noise, with 
amplitude defined by the Kepler reference case. 2 In particular, for the case 
of realistic solar or filtered solar photometric noise, this involves selecting a 
scaling factor f3 = 1.9. 

The planet can also influence the moon detection thresholds through 
inherent properties such as mass and radius, and through properties of its 
orbit. As discussed in chapter [9j planetary radius does not strongly affect 
detection thresholds as both the amplitude of At and €j are inversely pro- 
portional to A p + A m . Consequently, for this work, moon detection will not 
be investigated with respect to the radius of the planet. The mass of the 
planet is a different story. While planetary mass does not directly affect 
either the amplitude of At or the characteristic size of e,-, it is important 
as it alters the position where the assumption that v m <C vt r (required for 
deriving the expression for At) breaks down, and as predictions from moon 



Aigrain et al. ( 2004 I proposed a method for predicting the inherent photometric noise 
spectrum main sequence stars of any spectral type. However, in order to construct this 
method, they had to assume that some of the noise parameters of stars in general were 
the same as that of the Sun (as the parameters were poorly constrained). 

2 A relative photometric precision of 2 x 10 -5 over a 6.5 hour exposure (e.g. Borucki et 
al., 2003). For the case of white noise this translates to a relative photometric precision 
of 7.2 x 10 -5 for a thirty minute exposure. 
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stability and formation theory depend on planetary mass. As these quanti- 
ties are useful for comparison purposes, the effect of the mass of the planet 
will be investigated. The orbital parameters of the planet's orbit can also 
affect the detection threshold, in particular a p , I p and e p . a p alters moon 
detection both by altering the transit duration through altering v tr , but also 
by altering N, the total number of transits that will be observed (as the or- 
bital period of the planet depends on a p ). Consequently, the effect of a p on 
the thresholds will be investigated. In addition, e p and I p alter moon de- 
tection by altering the transit duration by altering vt r and the chord length 
respectively, and their effect on the detection threshold will be investigated 



specifically in sections 10.6.3 and 10.6.4 Consequently, planets will be in- 
vestigated in terms of M p and a p . In particular, the values of M p and a p 
that will be investigated will be 10Mj, lMj, IMjj and 1.M®, to cover the 
full range of available planets, and a p = 0.2AU (the closest and thus the 
most detectable planets that can host moons), 0.4AU and 0.6AU (the most 
distant planets that can host moons that are detectable by TTV P using 
Kepler) . 

Finally, the moon itself can (unsurprisingly) affect its detection threshold 
through its physical properties and its orbital parameters. The physical 
properties a moon can possess include its size (parameterised by its radius) , 
and its mass. As detection thresholds depend predominantly on moon size 



and not mass (see section 8.4.1), moon detection will be investigated in 
terms of moon radius. While moon radius cannot be simply determined 
from A m , it can be approximated. |Sartoretti Schneider (1999) looked 
at the error in assuming that A m / L^Ntra = (Rm/Rs) 2 and found that it 
was small for all but the most extreme values of I p . Consequently this 
approximation will also be used for this work. Also, as some of the moon 
formation and stability limits are formulated in terms of a moon mass as 



opposed to a radius e.g. the mass limit proposed by Canup & Ward (2006) 



(see section 3.3.2), it would be useful to have a conversion factor (i.e. a 
density) so that mass limits can be written in terms of moon radii. As all 
models are well inside the snow line, it was decided to use a density of 3000 
kgm -3 . For comparison, the moon's density is 3346 kgm -3 and Callisto's is 
1860 kgm -3 . The orbital parameters of the moon, in particular, d m , 6 m , I m 
and /m(0) +w m , may also affect moon detectability. As this is a preliminary 
investigation, we will not be looking at the effect of e m and I m on the TTV P 
moon detection threshold, and only consider moons on circular orbits which 
are aligned to the line-of-of sight. In addition, the effects 3 of / m (0) +uj m on 



In addition to the non-detection spikes associated with a m there are also non-detection 
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the detection threshold will be side-stepped in this work as first, the shape of 
the detection threshold becomes independent of f m (0) + uj m as N — > oo and 
second, as <j) = /m(0) + w m will be randomly selected for the Monte Carlo 
simulation conducted in section 10. 6| Consequently we will also investigate 
moon detectability in terms of a m . 

So in summary, the detection thresholds will be investigated in terms 
of M p , a p , R m and a m . As moons will be detected for a given planet, the 
thresholds will be displayed in a three by four grid of planetary mass and 
semi-major axes. Thus each of the twelve plots represents a host planet 
with a given value of M p and a p . The plots themselves show moon detection 
thresholds as a function of moon radius and semi-major axis. Now that the 
range of parameter space to be investigated has been discussed, the method 
for constructing these threshold maps needs to be decided. 



10.3 Method 

In order to numerically determine moon detection thresholds using TTV P , 
three main issues must be addressed. First, in order to calculate the value of 
and error in r for each transit, a particular observation window must be used. 
As discussed in chapter [7j for this thesis, we will use the smallest window 
which will always include the moon's transit. As we are now in a position 
to be be able to discuss this choice, the selection of this particular window 
will be revisited. Second, a method needs to be selected for determining 
the detection threshold. Finally, these two choices must be mathematically 
implemented in order to provide expressions which can give the detection 
threshold of a given moon. We begin with a discussion of the selection of 
the observation window. 



10.3.1 Selection of observation window 

As discussed in the introduction to this Part, for this thesis, t will be cal- 
culated using a section of light curve centered on the planetary transit and 
of length 

T obs = T tra + 2am{1 + em) . (10.1) 

Vtr 



spikes associated with <f> — / m (0) + ui m . For example, consider the case where uj — n and 



(f> — 0. From equation (8.1071 we have that At is given by the sequence A, — A, A, —A . . . , 
a signal which could be detected. Contrast this with the case where uj — n and <j> = tt/2. 
For this case Ar is given by the sequence 0, 0, 0, . . . , a signal which is not detectable. 
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Figure 10.1: Cartoon of a sequence of five light curves caused by the transit 
of a planet-moon pair for the case where the observing window is too short 
to always contain the transit of the moon. The beginning and end of the 
observation window are denoted by thick vertical lines, and each light curve 
is centered on the planetary transit. The region of the dip caused by the 
planet or moon inside the window and and consequently included in the sum 
used to calculate r, is crosshatched in black, while the region not included 
in the sum is crosshatched red. 
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Time 



(a) White noise. 



Time 



(b) Corellated noise. 



Figure 10.2: Diagram demonstrating how the the measured values, best fit 
function and errors in r change with observation window size and the type of 
photometric noise contaminating the light curve, for the cases of white and 
correlated photometric noise. A typical light curve (black dots) is shown 
at the base of each figure along with a cartoon theoretical light curve for 
the case of no photometric noise (thin line). In addition, on this light curve 
two concentric observing windows are indicated by bold vertical lines, the 
shorter inner window beginning at the grey line on the left and finishing at 
the grey line on the right, and a longer outer window beginning at the black 
line on the left and finishing at the black line on the right. The diagram 
above shows measured r values (crosses) as a function of transit number 
along with the best fit r function (dashed line) for the case where the grey 
observing window and black observing window have been used to calculate 
r. In particular, the transit light curve shown corresponds to the circled 
point on the diagram above, where the r values calculated using the grey 
observing window and the black observing window are indicated by a grey 
and black cross respectively. 



232 



This particular length window was selected as it is the smallest window 
which is ensured to always contain the moons transit. Now that the be- 
haviour of the timing perturbation and the noise have been discussed, we 
are finally in a position to revisit this assumption, and discuss why it was 
made. 

Realistically, one would like to select a concentric set of windows of 
differing sizes, check each one for signs of a moon, and then combine the 
results to give a statistical description of the types of moons that could or 
could not be detected. Unfortunately, there are two problems with this. 
First, if the window selected is too small, sections of the moon's transit 
may be neglected for some of the transits. Second, if multiple windows are 
selected, the calculated values of r are not statistically independent and it 
is difficult to construct thresholds. These issues will be discussed in turn. 

For the case where the window selected does not always include the all 
of moon's transit (see figure 10.1), problems can arise. In particular, these 
problems occur for transits where a section of the moon's transit is not 
included e.g. transits 2, 4 and 5 of figure [lOTTj For these cases, the approxi- 
mate expressions for At derived in chapter [8] are no longer correct and can 
no longer be used to determine thresholds. In addition this problem is fairly 
insidious as it only acts on the transits with large planet moon separations 
(and thus large At) and always acts to reduce the size of At. Unfortunately, 
there are also issues with using a concentric set of large windows to try and 
find a window with the correct size. 

To begin, we consider the effect on the measured values of r, and conse- 
quently on the derived thresholds, of moving from an observation window, 
centered on the planetary transit and long enough to always include the 
moon's transit to a longer observation window which is also centered on the 
planetary transit. To provide a context for this discussion, consider the two 
parts of figure 10.2 Figure 10.2(a) shows a typical sequence of r values as a 
function of transit number for the case of two different length observations 
windows (black and grey). In addition, the best fit r function and an ex- 
ample transit light curve containing white noise are also shown for the two 
cases. Figure 10.2(b) shows the same thing but for red noise. The discussion 
will be conducted using these two examples 

Beginning with the case of white noise, consider figure 10.2(a) , and in 
particular consider the circled point which corresponds to the example tran- 
sit light curve. Moving from using the shorter observing window (grey) to 
the longer window (black) to determine r, involves using all the points in 
the grey window, but also including the six extra data points (three from 
the left and three from the right), along with the additional noise associated 
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with them. Importantly, as the sum in the black window also includes all 
the points in the grey window, the error in r for both windows will not be 
independent. 4 In addition, this will be true for each of the transits, and con- 
sequently for the case of the black window, the best fit curve and whatever 
measures of goodness of fit are selected, will depend to an extent on the best 
fit curve and goodness of fit for the case of the grey window. As a result 
of this correlation, it is not easily apparent how to simply and accurately 
determine the statistical significance of a positive detection in one of the 
windows. 

For the case of correlated noise, the situation is even more complicated, 



for example consider the example light curve shown in figure 10.2(b) In 



this example, the correlated photometric noise can be seen as an downward 
trend in the light curve. As the dip is deeper later in the light curve, r 
will be biased toward later times and thus will be overestimated (i.e. €j is 
positive). In addition, as the noise is correlated, the new pieces of light curve 
included in the larger window are likely exhibit similar behaviour (resulting 
in an even more extreme value of €j). Thus, it seems reasonable that when 
moving from a smaller observation window to a larger one, the error in r will 
preferentially increase in magnitude, but stay the same sign. Consequently 
the issue of determining statistical significance of a detection in one of a 
number of windows for the case of correlated noise is even less clear. 

While the selection of an appropriate observation window is an important 
issue, it is outside the scope of this thesis as a result of the two reasons 
discussed above. However, as discussed in section 7.4.4 guesses can be 
made about the size of physically realistic windows from formation and 
stability constraints. As a result, for this work we will use equation (10.1) 
to describe the size of the observing window, continue with the analysis, and 
leave further discussion of this issue until chapter 11, within the context of 
future research directions. 



10.3.2 Selection of technique for calculating thresholds 

Now that we have expressions for At, the timing perturbation caused by the 
moon, a description of €j the timing noise, along with a well defined region 
of parameter space for which we would like moon detection thresholds, we 
are in a position to introduce and describe the method used in this thesis to 
calculate detection thresholds. To introduce this method, we first consider 

4 In particular the noise is random walk with the step-size taken from a Gaussian 
distribution with standard deviation &l V^vpo + jT p — t;) 2 , where the sum is carried 
out only over the new sections of window. 
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what detecting a moon means statistically, and formulate this into a null 
and alternative hypothesis. This description is then used to motivate the 
choice of statistical method used, likelihood radio testing, and in particular 
generalised likelihood radio testing. Finally, the expressions required to 
practically use this method are derived and discussed. 



10.3.2.1 Formulating a null and alternative hypothesis 

For this work we would like to differentiate between the case of "detecting" 
a moon, and not "detecting" a moon, where the "detection" depends on 
the statistical threshold selected. For the case where there is no moon, we 
expect the transit timings to be strictly periodic. Formulating this as the 
null hypothesis we have that 

Ho-.T^to+jTp + ej, (10.2) 

where ej is normally distributed with zero mean and known constant stan- 
dard deviation a e . For the case where there is a moon, we expect the transit 
timings to be the sum of the above linear function with a low amplitude 
sinusoidal perturbation. Formulating this as the alternative hypothesis we 
have that 

Hi : tj =t + jT p + A cos(wj + <p) + €j, (10.3) 

where ej is again, normally distributed with zero mean and known constant 5 
standard deviation a e . So, in a purely qualitative sense, the process of 
determining if a moon is detectable in a given sequence of data is the process 
of determining to the relative probability that the data would occur under 
H or Hi. 



10.3.2.2 Introduction to likelihood ratio testing 

A statistical method exists, likelihood ratio testing, which uses just this 
approach to determine if Hi is a better description of the data than Hq. In 
particular, the quantity calculated, A is the probability that the observed 
data were generated from the model corresponding to the null hypothesis 
divided by the probability that the data were generated from the model 
corresponding to the alternative hypothesis. In addition, it can be shown 



5 Near the detection threshold A is of the same order of magnitude as cr £ . As the second 
harmonic of At is being neglected as it's amplitude is of order v m /vtr times smaller than 
A, the perturbation in cr e due to the (A p +A m ) term must also be neglected as the changes 



are also of order v m /vtr times smaller than A v and A m (see equations ( 8 



and (8.69l). Thus a t can be considered constant for this application. 
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that this method is the optimal 6 statistical method for selecting between 
models for a broad range of model types (e.g. Rice, 1995 p. 303). To 
motivate and provide intuition with respect to this method we consider the 
case where to, T p , A, u and eft are known a priori (a useful, but unphysical 
assumption) and where we have a sequence ti, T2, . . . ,tn of recorded data 
points that we wish to test. 

To begin, consider the first data point t\. According to the null hy- 
pothesis, t\ should be normally distributed with mean to + T P and standard 
deviation a e , that is, it should have a probability distribution 

i (n-ttp+Tp)) 2 

P(ri) = ^? . (10.4) 

a e v 27T 

Consequently, assuming this model is correct, the probability that a value 
between t\ and t\ + 5t is measured is given by 

P(ri)<5r = — -=e 2CT ? St. (10.5) 
cr e v27r 

Similarly for the case of the alternative hypothesis, t\ should be normally 
distributed with mean tQ + T p + Acos{uj + (f)) and standard deviation a t , that 
is, have a probability distribution 

I (T 1 -(t +T p + J 4cos(w+</.))) 2 

P( n ) = —=e ^ . (10.6) 

Again, assuming this model is correct, the probability that a value between 
T\ and T\ + St is measured is given by 

1 (T 1 -(t o +T p +Acoa(^ + 0))) 2 

P(tx)St = ^ St. (10.7) 

a e v 27r 

By analogy, it follows that the probability that a value of ranging from 
Tj to Tj + 5t is measured under the null hypothesis, is given by 

1 (7j-(to+jTp)) 2 

P( Tj )6t = ^ St, (10.8) 



while for the alternative hypothesis it is given by 

P(Tj)5T = =e ^ St. (10.9) 

<T e v2vr 



6 The test that is least likely to accept the null hypothesis when it isn't true. 
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Now, as each of the e,'s are uncorrelated (see section 9.6), the probabil- 
ities corresponding to n, T2, . . . , tjv are independent for both the null and 
alternative hypotheses. For the case of independent events, the probability 
that a set of events all occur e.g. each of the t\ to t^ is produced under the 
model, it equal to the product of the individual probabilities, that is 

P(t 1 ,t 2 , . . . ,t n )(St) n = P(ti)6t x P(t 2 )5t x ... x P(t n )St, (10.10) 

N i (^--(to+iip)) 2 
= TT =e ^1 § T (10.11) 

for the case of the null hypothesis and 

P(t 1 ,t 2 ,...,t n )(6t) n = P(tx)8t x P(t 2 )5t x ... x P(t n )St, (10.12) 

N (T j -(t o +jT p +Acos( 1J j + 0))) 2 

=e 2 °i St, (10.13) 



- cr e V2vr 



for the case of the alternative hypothesis. Recalling that the test statistic, 
A, is the ratio of the probability that the data was produced under the 
null hypothesis to the probability that the data was produced under the 
alternative hypothesis, we obtain 



(Tj-(t +jT p )) 2 

■ v - - - - 



Illi ^ Sr 

A = 3 CT "^ 27r do 14 s ) 

(Xj -(tQ+jTp+A cos(^j+<j>)))2 ' K • J 

J- 1.7 = 1 rr,,/9V 



JV (Tj-(t +jT p )) 2 



(10.15) 



^JV (r ] -(t +jTp + A COS (uj] + <t>))) 2 



where the factors of a e V2jr and 5r have been cancelled. As A is the ratio of 
the probability that the null hypothesis describes the data to the probability 
that the alternative hypothesis describes the data, high values (A ~ 1) 
suggest that the null hypothesis is more likely, while low values (A <C 1) 
suggest that the alternative hypothesis is more likely. 

However, as previously mentioned, the system parameters, to, T p , A, uj 
and (j) are not known prior to, or after detection. Consequently, a more 
general version of this method will be used. 
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10.3.2.3 Introduction to generalised likelihood ratio testing 



For this work, generalised likelihood ratio testing will be used to calculate 
detection thresholds. This method has the same principles as likelihood ratio 
testing, except that instead of using known model parameters, the model 
parameters used are those most likely to have produced the data under that 
model for the case of the null and alternative hypotheses respectively. The 
cost of this generalisation is that the technique may no longer be optimal. 
Consequently, for this case, the null hypothesis is given by 

HQ-.T^to+jTp + ej, (10.16) 

where Cj is normally distributed with zero mean and known standard devia- 
tion a e , and where to an d T p are the coefficients derived from a linear least 
squares fit. 7 Similarly, the alternative hypothesis is given by 

Hi : Tj = i + jf p + A cos(u)j + <j>) + ej, (10.17) 

where ej is again normally distributed with zero mean and known standard 
deviation a e , and where to, T p , A, lj and <j) are the coefficients derived from 
a least squares fit for the case of a line plus a sinusoid. In particular, note 
that to, to and to are not necessarily equal, and that T p , T p and T p are also 
not necessarily equal. 8 Consequently, the expression for A is given by 

^AT (Tj-jtQ+jTp)) 2 

e e 

^ ~~ ^JV {T j -(i +jf p +Aco S (u,j + ^)))^ ' (10.18) 

Now that we have an expression for our test statistic A, we need a way to 
determine the statistical significance associated with a particular value of A, 
or as we will see with 21og(A), as it is more useful in practice. To determine 
this statistical significance we consider the distribution of 21og(A) for the 
case where the null hypothesis is true (i.e. that the low value of A occurred 
by chance). 

7 As the errors are normally distributed, the model, r most likely to have produced 
the data is the model which is fitted in the least-squares sense, that is, it minimises 
yZj—i ~ ^j) 2 i where Tj are the values of Tj predicted by the model (see e.g 
1992} p. 651). 

*Bee section 8.4 
formation of non-detection spikes. 



Press et al 
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We begin with equation (10.18) by taking the natural logarithm and 
simplifying to give 



21og(A) = 

3=1 



»( Tj -(i + jT p )f 



J2 ( T i ~ ^ ± &L ± j cos ^' ± ^ (10 19) 



As can be seen, this expression is comprised of two sums, which we will 
consider in turn. First, an expression of the form of the first sum is exactly 
described by a x 2 distribution 9 with N — 2 degrees of freedom, where we 
note that the 2 is due to the two fitting variables (to and T p ). Similarly, 
an expression of the form of the second sum is approximately 10 described 
by a x 2 distribution with N — 5 degrees of freedom, where we again note 
that the 5 is due to the five fitting variables (to, T p , A, Qj and <p). As the \ 2 
distribution describes the sum of the square of a sequence of independent 
normally distributed variables with mean zero and standard deviation one, 
the sum or difference of two variables which are x 2 distributed will also be 
X 2 distributed. Consequently, as 21og(A) is the difference between two vari- 
ables which are approximately x 2 distributed, it will also be x 2 distributed. 
In particular it is approximately distributed as a x 2 variable with three de- 
grees of freedom, 11 an approximation which becomes increasingly accurate 



as N becomes large (e.g. Rice, 1995, p. 310). So, for the case where N is 



large, 21og(A) should be approximately distributed as a x 2 variable with 
three degrees of freedom under the null hypothesis (there is no moon) , and 
thus we can determine the probability that such a large value of 2 log (A) 
(small value of A) occurred due to random chance. For example, the 95% 
confidence limit for the x| distribution is 7.816. Thus, if a value of 21og(A) 
was calculated corresponding to 7.816 we would be 95% sure that this wasn't 
just a statistical deviation, and it actually corresponded to a moon detec- 
tion. For the case where N is small we can simulate a statistically significant 
number of realisations of r, and use these to determine the null distribution 
of 2 log(A). For example, for the case where N = 9 the 95% confidence limit 
is given by 8.45 ± 0.09. 



9 Recall that a x 2 distribution with v degrees of freedom is generated by summing the 
square of v independent standard normally distributed variables. 

10 It is not exactly described by a \ 2 distribution as the fitting formula is non-linear in 
the fitting parameters uj and <j>. 

11 Corresponding to the three degrees of freedom lost by also fitting the sinusoid. 
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Now that we have a method for calculating the detection threshold, we 
can begin to explore the behaviour of this threshold. This will be done 
in three main stages. First, analytic expressions for the location of the 
thresholds will be derived for the case where N — > oo. Then, these will be 



compared to similar moon detection thresholds derived in section 4.3.2 for 
the three other methods presented in the literature. Finally, a Monte Carlo 
simulation will be used to investigate the case of low N, and in this context, 
the effect of N, and inclination, eccentricity and orientation of the planet's 
orbit on moon detection will be discussed. 



10.4 Expected behaviour of detection thresholds 

Armed with the equation determining whether or not a moon is detectable 



in a given sequence of r values, equation (10.19), we now investigate the 
behaviour of the detection thresholds. While equation (10.19) is analytic 
(from 7*1, 7*2, . . ., tjv, the values to, T p , to, T p , A, to and <j> can be calcu- 
lated), it is so complex that is does not provide much in the way of intuitive 
understanding. 

As most of the complexity stems from the finite N nature of the fitting 
process, we will use a two pronged method. First, for the sections of the 
threshold where the finite nature of iV is important e.g. in a non-detection 
spike, we will use the analysis conducted in chapter [8j summarised below for 
convenience. For the regions of the threshold where the finite nature of N is 
not important we will use an approximation assuming large iV to determine 
the gross behaviour. 

10.4.1 Summary of behaviour of non-detection spikes 

For the case where a moon orbits the planet an integer number of times per 
transit, the moon detection threshold will show a non-detection spike. Phys- 
ically, this spike results from the fact that the planet and moon present the 
same orientation each transit and thus produce no transit to transit timing 
perturbation. The distribution and shape of these spikes was discussed and 



investigated in section 8.4.2 and in particular it was found that the moon 
detection threshold should be decorated with a "comb" of non-detection 
spikes, where the spacing between neighbouring spikes is proportional to 
o^ 5 . In addition, each individual spike should be approximately symmetric 
and have width proportional to Na^. Consequently, as N, the number of 
observed transits increases, each spike should remain centered on the same 
semi-major axis, but decrease in width, such that as N tends to infinity, the 
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spikes become infinitely thin, and can be neglected. We now concentrate on 
the shape of the threshold for this case. 



10.4.2 Limit as N -»■ oo 



To investigate the shape of the threshold as N, the number of measured 
transits, becomes large, we begin by assuming that our r values are described 
by 



Tj = t + jT p + A cos(ujj + <j)) + e 



j- 



(10.20) 



where ej, the timing noise, is normally distributed with known standard 
deviation a e . As N increases, we expect that our fitted parameters, to, T p , 
A, Cj and (f>, would tend towards 12 the true values and would asymptotically 
approach them asiV-^ oo. In addition, the fitted values, to an d T p , for the 
linear fit also tend to to and T p as ./V — > oo (see appendix [L| . Consequently, 
equation (10.19) becomes 



1 N 

2 log(A) = £(io + jT p + A cos(wj +(/>) + ej - (t + jT p )) 2 

- {to + jT p + A cos{ujj + <j)) + ej - (t + 3% + A cos(wj + 0))) 2 , (10.21) 
which simplifies to 

N N 

21og(A) = ^ ^ Acos ^ + ^)) 2 + ^E«jMwj + <f>)- (10-22) 



07 



3=1 



Consider the first sum on the right hand side of equation (10.22). As 



all the terms under this sum sign are squares of real numbers, they are 
positive by definition. In addition, as each of these terms are of order A 2 , the 
characteristic size of the sum is N x A 2 /a 2 . In comparison, the second sum 
consists of terms which are the product of a normally distributed variable 
ej, and an expression of characteristic size A. Recalling that the standard 
deviation of the sum of iV independent, normally distributed variables with 
standard deviation a is y/Na, we have that the characteristic size of the 
second sum in equation (10.22) is yN x A/a e . Consequently, for large N 



(N 3> (A/a e ) 2 ), the size of the second sum in equation (10.22) will be 
much smaller than the first, and can thus be neglected. 



"The fitted value of uj may vary from the true value by a multiple of 2tt as a result of 



aliasing. 
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Now, assuming that the orbit of the moon is not in resonance with the 
orbit of the planet, the sequence of in-transit moon positions corresponding 
to the sequence of Acos(uj + <p) values will never repeat. Consequently, 
after a sufficiently large number of transits, all parts of the moon's orbit will 



be sampled equally. Thus, the sum in equation (10.22) can be replaced with 
N multiplied by At 2 , the average value of A 2 cos 2 (ujj + (f>) . Averaging over 
a full orbit, we obtain 



Ar 2 



^ ,UJ {Acos{u:t + (t))) 2 dt 
2-k/u ' 
2tt/uj(A 2 )/2 



2-k/oj 



2 ' 



(10.23) 
(10.24) 
(10.25) 



Finally, the distribution of 2 log(A) is also a function of N. In particular, 
as mentioned previously, as N — > oo, it tends to a x 2 distribution with three 
degrees of freedom. The 99.7% limit for a x! distribution is 13.93. Con- 
sequently, the equation describing the 99.7% threshold for moon detection 



would be given by replacing 21og(A) with 13.93 in equation (10.22). 



Applying these three simplifications to equation (10.22), the following 
expression for the detection threshold is obtained, 



13.95 = N x 



A^_ 
2a 2 / 



(10.26) 



From equation (8.72) we have that for the case of circular and coplanar 



orbits, A, the amplitude of Ar, is given by 



A = cos 



17"mRch \ A. 



± m 



a, 



A p + A m M p + M m Vtr 



(10.27) 



Recalling from section [8.4.2.1 that for moons which are detectable and can 
be described by this analysis, the cosine term in equation (10.27) is approx- 



imately equal to one, and can consequently be neglected. Performing this 



simplification, substituting this into equation (10.26) and rearranging such 
that all the terms involving A p and A m are on the left hand side and all the 
other terms are on the right gives 



Ap -\- A m 



5.28 



a e M p + M m v tr 
iV M p a m 



(10.28) 
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Now, assuming that the planet is much larger than the moon (A p S> A Ti 



and M p 3> M m ) equation (10.28) simplifies to 



A m 
A n 



5.28 



CT e Vtr 



N a r , 



(10.29) 



As discussed in section 10.2, A m /LoNt ra can be approximated by (R m /R s ) 2 . 
Similarly, A p / L^Ntra can be approximated by (R p /R s ) 2 . Consequently, 
A m j Ap can be approximated by R m / R 2 . Using this expression gives 



R!L, — 5.28R 2 . 



N O r , 



(10.30) 



As a e is different for the cases of white, filtered and red noise, equa- 



tion (10.30) will be investigated separately for each of these cases. In par- 
ticular, the fact that moons with a m <C R s and a m 3> R s are not detectable 
will be discussed, followed by an investigation of the threshold minimum for 
the case where a m « R s , corresponding to the detectable moon with the 
smallest radius. 



10.4.2.1 Behaviour of threshold in the case of white noise 



For the case of white photometric noise, a e is given by equation (9.29), 



a f = 47.9s 



3.95 x 10" 4 



At y/ 2 ' 

lmin J 



100(Ap + Ar. 
NL 



( Tobs_\ 3 ^ 2 ( Ttra \ 1 

V24hrs J V 13hrs / 



Noting that N t raLo/(A p + A m ) is approximately equal to R 2 s /Rl and 



substituting this expression into equation (10.30), and simplifying gives 

cr/L 



R m = 2.15 x 10 _o iif 



1 



3.95 x 10" 4 



At 
lmin 



1/2- 



/ Ttra \ 

V 13hrs J 



\ -1/2 



{2R S + 2a m f/ 2 
o m V / 2~Rl 



, (10.31) 



where equations (10.1) and (7.18) were used to substitute for and T tra - 



This equation describes the minimum radius of a moon that can be detected 
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to three sigma significance as a function of the moon's semi-major axis. 
As discussed in section 19.61 for the case where a m <C R s , the R s term 

1 12 

dominates in the (2R S + 2o m ) term and so R m oc 1/am , and thus such 
moons are undetectable. Similarly, for the case where a m S> R s , the a m 
term dominates the (2R S + 2a m ) expression, we have that R m oc Om 4 , and 
again, moons are undetectable. However, for the case where a m ~ R s , there 
is a region for which moon detection is possible. To investigate the shape 
of this region, the minimum of this threshold curve, that is, the semi-major 
axis which gives the smallest value of R m , will be determined. 

To find the minimum of this function, the derivative is taken with respect 
to a m and then 4^- is set equal to zero, giving 

Q= 1 3(2fl 5 + 2a m ) 1 / 2 1 {2R s + 2a m f' 2 
a m \J2R S a m \J2R S 



(2R S + 2a™) 1 / 2 



a 



V2R~s 



[3am ~ (2R S + 2a™)] . (10.33) 



where all the terms which do not depend on a m have been neglected. As a m 
cannot be infinite, we multiply by a m ^2R s , to give 

= (2R S + 2a™) 1 / 2 [a m - 2R S ] . (10.34) 

Noting that the first term cannot equal zero (as only positive semi-major 
axes are physical) the equation can be divided through by it, giving 

= a m - 2R S , (10.35) 



or 



2R S . (10.36) 



Consequently, for the case where the photometric noise is white, we 
expect that moons with a m <C R s will be undetectable, moons with (z m « R, 
will be detectable, with the most detectable moons having semi-major axis 
equal to a stellar diameter and that moons with a m 3> R s will also be 
undetectable. In addition, for the case where the planet's orbit is inclined 
(5mi n ^ 0), it can be shown using an equivalent derivation that for this 
case the most detectable moons have semi-major axis equal to the length 
of the chord they make on the star, that is, a m = 2R S (1 — <5 2 • ) x / 2 . To 



put this result in context, recall from figure 8.8 that At is very accurately 
represented by a sinusoid, for moons with a m ~ 2R S , even for large values 
of v m /vtr- Consequently, this result is very robust. 
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10.4.2.2 Behaviour of threshold in the case of filtered noise 



For the case where the photometric noise is dominated by filtered realistic 



noise, a e is given by equation (|9.36|), 
a e = 53.2s [P] 




8 X 10 ^ (2^) + L0 ° 8 } 



\24hrs J 



Neglecting the constant term in the large round brackets as it is much smaller 
than the other term for all transits of interest gives 



o> « 53.6s I 



W0(A p + A r , 

L^Ntra 



-1 



/ Ttra \ 

Vl3hrs J 



-1 



24hrs 



(10.37) 



Substituting this into equation (10.30) and simplifying gives 



R, 



1.77 x 10~°R 



5d2 P (2R s + 2a n ^ 



N 2R s a„ 



(10.38) 



where we again note that N tra Lo/ {A p + A m ) « R^/Rp and where where 



equations (10.1) and (7.18) have been used to substitute for T t> s and Tt ra 



Again, as for the case of white noise, moons with a m <C R s or a m 3> R s will 

1/2 

be undetectable. In particular, for the case of a m <C R s , R m oc 1/dm , and 

1/2 

for the case of a m 3> R s , R m oc am ■ While very close or very distant moons 
are undetectable, moons with a m « R s may be detectable. To investigate 
this case we again consider the semi-major axis corresponding to the smallest 
detectable moon. 



Taking the derivative of equation (10.38) with respect to a m , setting 
^p 21 = and simplifying gives 







1 4(2R s + 2a r , 



dm 2 R s 
(2R S + 2a m ) 



[4a r , 



1 (2^ + 2a m 
- {2R S + 2a m )} . 



2R s al 

Noting that the first factor again cannot be equal to zero, we have that 



(10.39) 
(10.40) 



= 4a m - (2R S + 2a r , 



(10.41) 
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or 



R.i 



(10.42) 



So, for the case where the light curve is dominated by filtered noise, 
moons with a m <C R s are undetectable, moons with (irn 

« R s are possibly 

detectable, with the most detectable moons having semi-major axis given 
by a m = Rs and finally, moons with a m S> R s are again undetectable. In 
addition, repeating this analysis for the case of inclined orbits again alters 
the semi-major axis corresponding to the minimum. In particular, for this 
case, the minimum occurs for a m = R s (l — i^) 1 / 2 . Again, recall from 



figure 8.8, that At can be well described by a sinusoid for moons with 
a m = R s . Consequently, this result is also relatively robust. 



10.4.2.3 Behaviour of threshold in the case of red noise 

For the case where the photometric noise is dominated by red noise, a e is 



given by equation (9.31), 



a f = 103.7s I 



UN, 



jv ira 



100(A P + A r , 



( T tra \ 

V 13hrs J 



-i 



x f 0.277 (^-) +0.714 f^-Y 



where again the constant term has been neglected. Unlike the case of filtered 
noise, both remaining terms are dominant, and so an analysis of the type 
conducted for the case of white and filtered noise will not be possible. While 
simple analytic expressions for the semi-major axis corresponding to the 
threshold minimum are not available, the behaviour of the minimum can be 
bracketed by investigating the cases of a e oc T^ bs /T tra and a e oc T^ bs /T tra . 
In addition, only the case of a e oc T^ bs /T tra needs to be analysed as the case 
of cr e oc T^ bs /T tra was investigated in the previous section in the context of 
filtered noise. 



Substituting this into equation (10.30) and simplifying gives 



D 2 D 2 P i^Rs + a r, 
K m ^ K s 



N 2R s a r , 



(10.43) 



For the case where a e oc T^ bs /Tt ra , we again have that moons with a m *C R s 

1/2 

or a m S> R s are undetectable as R m oc 1/a™ and R m oc a m respectively. 
However, for a m ~ R s , there is a chance of detection. Again, determining 



246 



the semi- major axis corresponding to the minimum value of R m , by differ- 
entiation gives 



1 6(2R s + 2a m ) 2 1 (2R s + 2 
~~ ~n or ^2 op ' (.1U.44J 

= {2R ; + 2 2 Qm)2 [6a m - (2ii s + 2a™)] , (10.45) 

which gives, 

= 6a m -(2 J R s + 2a m ), (10.46) 

a m = ^R s . (10.47) 

For the case where the light curve is dominated by red photometric 
noise, moons with a m <C R s or a m 3> R s are again undetectable, while 
moons with a m ~ R s may again be detectable. In particular, the semi- 
major axis corresponding to the smallest detectable moon ranges from R s 
to 1/2R S depending on the transit duration, where it tends towards R s 
for short transit durations and 1/2R S for longer transit durations. Finally, 
repeating this analysis for the case of inclined planet orbits, it is found that 
this range changes from one half to a quarter of the diameter of the star to 
one half to a quarter of the length of the chord made by the path of the 
planet across the face of the star. Unlike the case for white and filtered 
noise, this minima may lie in the region where our approximation for At 



or 



is no longer accurate (see figure 8.8(c)). Consequently, for this case, the 



specific behaviour of the threshold will also depend on v m /vt r - 
10.4.3 Summary of expected properties 

Combining the summary of the properties of the non-detection spikes with 
the properties derived for the case where the number of transits becomes 
large, a comprehensive picture of the type of behaviour that we expect the 



detection threshold to show can be formed (see figure 10.3). To begin, we 
expect the thresholds to be in the shape of a distorted "U", such that the 
minimum radius of a detectable moon tends to infinity as the moon semi- 
major axis tends either to or to infinity. In addition, the minima of this 
curve should occur at a m = 2R S (1 — ^m) 1 ^ 2 f° r the case °f white noise, 
a m = R s (l — 5„j„) 1//2 for the case of filtered noise and between R s (l — 
^mm) 1//2 ano - l/2-R s (l — ^min) 1 ^ 2 for the case of red noise, where we note 
that 2R S (1 — 5 min ) l l 2 is the length of the chord that the planet makes across 
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Moon Semi-major Axis 

' ' ' ' 2 1/2 

y 2 R ch R ch ? 2 R ch 2R ch Rch=R s (l-6 rai „) 

Figure 10.3: Cartoon showing the expected features of TTV P moon detection 
thresholds for the case of a light curve contaminated with white (black line), 
filtered (green line) and red (red line) photometric noise. The detection 
threshold estimated by assuming N is large is denoted by a dashed line, 
while the full detection threshold is shown as a thicker solid line. 



the face of its host star. In addition to this general behaviour, the threshold 
should also be decorated with a comb of non-detection spikes with spacing 
proportional to a^ 5 and width proportional to Na^- Now that we have an 
understanding of the shape and behaviour of the TTV P detection threshold, 
we can compare it with similar detection thresholds calculated for the three 
other transiting moon detection techniques. 



10.5 Comparison with literature thresholds 

One of the aims of this analysis is to compare the set of moons that can 
be detected by TTV P with the set of moons that can be detected using 
other transit moon detection methods, namely direct detection, barycentric 
transit timing and transit duration variation. Approximate expressions for 
the three sigma detection thresholds associated with these techniques were 
derived for the case of large N and white photometric noise in section |4.3.2 
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Figure 10.4: Figure of the same form as figure |4.4[ but also showing the 



TTVp threshold described by equation (10.48), plotted in green 
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(equations (4.18), (4.25) and (4.28)) and are restated below for convenience 

1 [ a L /L f At \ V2 / 13hrs\ ^ 
iV 1 / 4 3.95 x 10~ 4 VlminJ \ T tra ) ' 



Rm = 0.0065iL 



Rm = 0.0168i? s 



At 



1/2 



ATl/6 3.95 x 10- 4 Vlmm, 



1/3 



/I3hrs\ 1/6 

\ T tra J 

Pp \ 1/3 f Rp V /2 

0.1R„ ) 



Rr, 



0.0197i? s 



ATl/6 



3.95 x 10" 4 Vlmin J 



1/2 



0"m 

R.i 



1/3 



1/6 



/13hrs\ 1/2 

V Ttra J 



Pp 

Pice 



V6 /_. \ 1/3 
Pice 

Pi, 



and plotted as a function of a m in figure 10.4, which is a recreation of 



figure 4.4 



We now have the tools to derive an equivalent expression for the case of 



the photometric transit timing technique. From equation (10.31), we have 



that the three sigma detection threshold for the case of photometric transit 
timing is given by 



R r , 



0.0066i? s 



1 



ATl/4 



a L /L. 



o 



3.95 x 10 



-i 



At 
lmin 



1/2" 



1/2 



/13hrs\ 1/4 

\ Ttra J 



Ry 



-1/2 



1 + 



3/4 



(10.48) 



where the equation has been recast into physical variables for ease of com- 



parison. This threshold is also shown in green in figure 10.4 



Through a comparison of equations (4.18), (4.25), (4.28) and (10.48), and 



a visual examination of figure 10. 4[ a number of similarities and differences 
between the thresholds corresponding to the four methods can be seen. In 
particular the dependance of the threshold on N and a m . These aspects 
will be discussed, followed by a short discussion on the comparative effect 
of correlated noise on each of these methods. 
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We begin the comparison of the TTV P detection threshold with the three 
thresholds associated with the other methods, by considering the depen- 
dance on AT, the number of transits recorded. Comparing equations (4.18), 
Q4.25[ ), ( |4.28| ) and ( |10.48| ) we see that the TTV P threshold has the same de- 
pendance on A~ as the direct detection threshold as opposed to the 1/iV 1 / 6 
dependance shown by barycentric transit timing and transit duration vari- 
ation (see equations (4.25) and (4.28)). The physical origin of this depen- 
dance is that photometric transit timing and direct detection measure the 
cross-sectional area of the moon (oc i?^J while barycentric transit timing 
and transit duration variation measure its mass (oc R^)- Consequently, as 
a result of this dependance on N, the thresholds for the case of TTV P will 
decrease at the same rate as those for direct detection as the number of 
recorded transits increases, but more rapidly than those for the two other 
timing methods. 

In addition to the dependance on N, the dependance of these thresh- 
olds on moon semi-major axis is also of interest. As can be seen in equa- 
tions (4.18), (4.25) and (4.28), and as was discussed in section 4.3.2 the 
moon detection thresholds for the case of direct detection, barycentric tran- 
sit timing and transit duration variation are independent of a m , decrease 
with increasing a m and increase with increasing a m respectively. Physically 
this means that the barycentric transit timing and transit duration variation 
techniques are optimised to detect long and short period moons respectively, 
while direct detection is equally as good at detecting all moons. In com- 
parison, the TTVp technique is optimised to detect moons with a m ~ 2R S 
(for the case of white noise), and, in addition, cannot be used to detect very 
close or distant moons. As the optimal range probed by the other two timing 
techniques brackets the range of semi-major axis for which this technique 
works, and recalling that the direct detection threshold has not yet been 
extended to deal with red noise, it can be seen that the TTV P technique 
could be a complimentary technique. 

Finally, in addition to having different dependancies on A^ and a m , the 
direct detection, barycentric transit timing, transit duration variation and 
photometric transit timing techniques are also affected to differing extents 
by correlated noise. Prior to this thesis, the only work to investigate the 
effect of correlated noise on moon detection was that of Kipping et al. ([2009 ) . 
They suggested that the transit duration variation technique was relatively 
immune to the effects of correlated noise. In comparison, in chapter [9] of 
this thesis it was found that correlated noise leads to a substantial decrease 
in moon detectability using the the TTV P technique. Consequently, this 
aspect needs to be kept in mind before applying this technique to real data. 
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So, in summary, the direct detection, barycentric transit timing, tran- 
sit duration variation and photometric transit timing techniques all probe 
different portions of parameter space. In particular, the direct detection 
and photometric transit timing thresholds will decrease more rapidly with 
increasing N than those of barycentric transit timing or transit duration 
variation as a result of the different physical quantities that they measure. 
In addition, the TTV P method has a range of semi-major axes [a m ~ 2R S ) 
for which it is optimised to detect moons, as opposed to barycentric transit 
timing and transit duration variation which are optimised to detect long and 
short period moons respectively. Finally, the effect of correlated noise may 
affect these results. For example, as shown in chapter [9] the timing noise on 
r increases dramatically if the light curve is contaminated with correlated 
timing noise. Consequently for the case of a host star with a low number 
of transits and correlated photometric noise, barycentric transit timing and 
transit duration variation, which are less affected by red noise, may be more 
optimal to use, despite their unfavourable dependance on N . 

Now that the TTV P threshold has been discussed in isolation and put 
into the context of the other transit detection methods presented in the 
literature, we are in the position of being able to discuss and understand 
realistic detection thresholds. With this in mind numerically calculated 
detection thresholds for the case where N is small will be discussed in the 
next section. 



10.6 Numerically calculated TTV p moon detection 
thresholds 

Now that we have the tools available to calculate and analyse detection 
thresholds for TTV p , we are in a position to calculate thresholds for the 
more realistic case of a finite number of transits. While we have equa- 



tion (10.19), the expression for the statistic 2 log A, defining the location of 
the thresholds, the process required to calculate the thresholds is not trivial 
and consequently we begin with a brief discussion of this aspect. Then we 
move on to analysing and discussing the calculated thresholds. As discussed 



in section 10.2 we will perform this investigation in three stages. First, we 
will investigate the simplest case, that of a planet on a circular orbit passing 
across the central chord of its host star ( i.e. Srnin — 0)* This case will then 
be used as a comparison case for the investigations into the effects of slight 
inclination of the planetary orbit (5 m i n 7^ 0) and eccentricity in the plan- 
etary orbit on moon detectability. We begin by summarising the method 
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Figure 10.5: Plot of the 99.7% detection threshold for the case where the 
orbit of the planet is circular and aligned to the line-of-sight for the 12 



cases discussed in section 10.2 The predicted maximum mass of a naturally 

formed moon is shown as a horizontal dashed line, while the minimum and 

maximum semi-major axis that a moon undergoing either inward or outward 

orbital evolution would have achieved, assuming a system lifetime of 5Gyr, 

are shown by solid grey lines. For the case where the planet has the mass of 

Jupiter, Uranus or the Earth, the radii and semi-major axes of the satellites 

of Jupiter, Uranus or the Earth are also plotted (grey circles). Finally, the 

interior of the planet and the three-body unstable region are hatched. 
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used for generating thresholds. 



10.6.1 Method for calculating Monte Carlo Thresholds 

For this thesis we define the threshold as the line separating the region 
of parameter space where "most" moons are "detectable" from the region 
of parameter space where "most" moons are not "detectable", where "de- 
tectable" is defined by whether or not the value of 2 log A calculated using 



equation (10.19) lies above a critical value. So, in order to practically cal- 
culate these thresholds, we need a definition of "most" , as well as a way of 
determining if a system is "detectable", which is not too computationally 
expensive. These two choices will be discussed in turn. 

We begin by discussing and defining the term "most" . The selection of 
the definition of "most" is not a trivial issue for two main reasons. First, 
as a result of statistical fluctuations, there is a non-zero probability that 
a moon will be "detected" when there is no moon, for example, for the 
thresholds presented in this chapter this probability is 0.3%, so the limit 



cannot be set too low. In addition, as mentioned in section 8.4.2.3 and in 



the footnote on page 229 moon detectability depends on 0, for example, the 
width of the non-detection spikes is a function of 4>. As a result, selecting a 
definition of "most" which is too high will effectively result in a measure of 
the detectability of the least detectable moons with that radius and semi- 



major axis (see the footnote on page 229 for an extreme example). As 
a compromise between these two effects, it was decided that the point at 
which "most" moons were detected corresponded to the point at which 50% 
of the moons would have been detected. Now that the term "most" has 
been defined, the method used to determine 2 log A, and thus if a moon has 
been "detected" will be briefly outlined. 

Following from the above discussion, the most intuitively obvious ap- 
proach to determining 2 log A, and thus constructing thresholds would be 
to discretize the moon radius semi- major axis plane into a grid of points, 
simulate a statistically representative number of r sequences for each point, 
determine the median 2 log A value for each point and draw the threshold 
where this value is equal to the critical value. However, this approach is 
not feasible as it is computationally expensive, consequently a different ap- 
proach was used. First, three non-physical grids were set up corresponding 
to the cases of N = 9 (a p = 0.6AU), N = 14 (a p = 0.6AU) and N = 40 
(a p = 0.2AU), with the aim of transforming the results to generate the 
thresholds. Instead of discretizing on moon radius, the quantity A/a e was 
used (ranging from to 8) and instead of discretizing on moon semi-major 
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axis, the quantity oj was used (ranging from to 2tt). In addition, for these 
models to and T p were both set to 1 (such that they were the same order of 
magnitude as A /a). For each grid point a statistically significant number of 
realisations of r, defined as 

A e • 

Tj = t + jT p + — cos(wj + 4>) + — , (10.49) 

were calculated. In particular, note that £j/cr e is a normally distributed 
variable with standard deviation 1 and mean 0, and is given by a ran- 
dom variable which is uniformly distributed between and 2tt. For this 
application 51 realisations were used. 13 

For each realisation, the value of 2 log A was calculated and recorded. 
Finally the values were sorted by size and the 26 th (the middle value) was 
selected. This data was then transformed to a grid of 25 by 300 points in 
moon radius and semi- major axis space for realistic values of to and T p using 



the method described in appendix M While this method is advantageous 



in that it dramatically decreases the computation time required to produce 
each plot, it assumes that At is well approximated by a sinusoid 1 for all 
values of moon semi-major axis. Taking this approximation into account 
and continuing, to use this information to calculate the thresholds to a 
given significance, the probability distribution of 2 log A for the case where 
there is no moon is required. 

For the case where the fitting model is linear, or, the number of transits, 
N, is large, the distribution of 2 log A for the case where there is no moon 
would be given by a x 2 distribution with three degrees of freedom. However, 
as mentioned previously, the fitting model is not linear in the parameters u 
and (j). Consequently, to determine the form of the distribution of 2 log A, 
simulations were run for the case of = 9, 14 and 40 for the case where there 
was no moon. For each simulation 10000 realizations of r were generated for 
the case where A = and analysed to determine the shape of the probability 



13 



The behaviour of moon detectability changes as <j> changes by n/2 (see footnote on 



page 



229 for an example). Consequently, in order to categorise the behaviour, the number 



of simulations must be s ubst anti ally larg er than (tt/2)/2tt = 4 



14 



Recall from sections 



8.3 



and 



1.4.1.1 



that Ar is only well-described by a sinusoid for 
the case where v m /v tr is small and where the change in velocity during transit is not 
significant. Consequently, the thresholds calculated using this method may be inaccurate 
for the case where v m /v tr is large or the transit duration is comparable to, or larger than 
the moon's orbital period. To give a feel for where this problem may arise, thresholds 
in the region where v m /v tr is large (v m /vt r > 0.66), or the moon noticeably accelerates 
during transit (T tr a > l/4T m ) a dashed line style as opposed to a solid line style is used. 
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distribution of 2 log A. This process was then repeated six times in order to 
gain an understanding of the variability of the calculated critical values. 

For this work, the 99.7% threshold is used (recall that for a standard 
normal distribution, the probability of being three sigma away from the 
mean is 100% - 99.7% = 0.3%). From these simulations it was found that 
99.7% of simulated r sequences had 2 log A values less that 14.99 ± 0.27, 
15.00 ± 0.26 and 15.33 ± 0.36 for the cases of A/" = 9, 14 and 40 respectively. 
As the confidence limits on these values all overlap it was decided to use 
the value 15 to generate all the 99.7% thresholds shown in this chapter. 
However, while this is the value used for this work, different thresholds can 
be selected (see appendix [N| for the 68.3% and 95.4% thresholds) 

Now that the method for calculating the thresholds has been briefly 
described, the thresholds themselves can be calculated and analysed. We 
begin with the case of circular planet and moon orbits, for the case where 

3min — 0. 



10.6.2 Circular coplanar orbits 



We begin with the simplest configuration of planet and moon orbits, namely 
the case where both the planet and moon have circular orbits with the planet 
and moon transit the central chord of the star. Using the method discussed 
above, the 99.7% detection thresholds were calculated assuming white, re- 
alistic and filtered realistic photometric noise, for the 12 cases discussed in 
section 10.2 (see figure 10.5). These thresholds will be discussed in terms 



the dependance of the threshold on moon semi-major axis, dependance of 
the threshold on planet semi-major axis, and the location of the threshold 
with respect to moon formation and stability limits. 



10.6.2.1 Dependance of threshold on moon semi-major axis 



As discussed in section 10.4, we expect the detection threshold to have a 
lop-sided U-shape with a minimum at a m = 2Rq for the case of white pho- 
tometric noise, a m = Rq for the case of filtered noise, and between 1/2Rq 
and Rq for the case of realistic noise. Noting that Rq = 0.00465^4?7, it 



can be seen from figure 10.5, that for the case where the minimum is well 
described by this analysis i.e. is not dashed, these approximate relations 
accurately describe the shape of the thresholds, especially for the case of 
filtered and solar photometric noise. In addition, we expect that the thresh- 
old is decorated with a set of non-detection spikes with width and spacing 
proportional to a^ 5 . Such spikes can be clearly seen in the calculated thresh- 
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olds for large values of a m , for example for a m larger than 1.5 x 10 _2 AU in 
figure 10.5(c)| For smaller values of a m the regular pattern is broken as a 
result of finite numerical resolution, that is, the pattern is disrupted when 
the distance between neighbouring spikes becomes comparable or smaller 
than the discretisation used to construct the threshold. 



10.6.2.2 Dependance of threshold on planet semi-major axis 



As can be seen from figure 10.5 moons are more detectable around short 
period planets. For example, for the case of filtered noise, the minimum 
radius of a detectable moon is a little over one Earth radius for the case 
of a planet at 0.6AU, compared to a little over half an Earth radius for 
a planet at 0.2AU. This trend is a result of two main factors. First, the 
shorter transit durations, and thus shorter observing durations, associated 
with short period planets result in a decrease in the amplitude of the timing 
noise. This effect is particularly apparent for the case of realistic stellar 
photometric noise as a result of the superlinear relationship between €j and 
T bs (see figure 9.6). Second, planets with smaller semi-major axes undergo 



more transits (Nt ra cx a p 1 ) which counteracts the decrease in amplitude 

l li 

of At (At oc a p ' ) and results in a net increase in moon detectability. 



10.6.2.3 Comparison with formation and stability limits 

In addition to understanding the thresholds in isolation, it is also useful to 
view them in the context of the set of moons which are physically realistic, 
that is, the set of moons which are able to form, and are orbitally stable 
for the lifetime of the system (see chapter [3] for an overview) . In each of 
figures 10.5(a) through to 10.5(1) the set of moons which are likely to form 
and are orbitally stable is indicated in a number of different ways. First, for 
each planet where a Solar System analog exists, the moons of that analog 
are shown as grey circles, for example for the case of a one Jupiter mass 
host, the four Gallilean satellites are shown. Second the maximum moon 



mass limit presented in the literature (see sections 3.3.1 and 3.3.2) is shown 
as a dashed grey line. Third, to indicate the effect of orbital evolution, the 
two filled grey lines show the semi-major axis that the innermost and outer- 
most stable moons would have achieved given a system lifetime of 5Gyr (see 
section 3.4.3). Finally, the region which is three-body unstable is hatched. 



As moon formation and stability processes depend strongly on the host 
planet mass, the cases of the four different host planet masses will be dis- 
cussed in turn. For the case a the ten Jupiter mass host planet, physically 
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realistic moons are detectable the the case of white and filtered noise for 
all planet semi-major axes of investigated. In addition, for the case where 
the planetary semi-major axis is 0.2AU, physically realistic moons could be 
detected even for the case of unfiltered noise. This favourable situation is 
a result of the linear dependance of the maximum moon mass on planetary 



mass predicted by Canup & Ward (2006). For the case of a one Jupiter 



mass host, physically realistic moons are again detectable, but only for the 
case of white photometric noise. Interestingly, by comparing the threshold 
for the case of 0.2AU and 0.4AU, it can be seen that the detection of a 
Ganymede analog may be possible. In comparison, for the case of a one 
Uranus mass host planet, while stable moons are detectable, no moons pre- 



dicted to form according to the Canup & Ward (2006) model are detectable 



at either the 99.7% level or even the 68.3% level (see appendix [NJ . Finally, 
for the case of Earth-like planets, neither stable or physically realistic moons 
are detectable. Consequently, it may be possible to detect moons of large 
gas giants, and unexpectedly large moons of smaller gas giants, but it is not 
possible to detect moons of terrestrial planets as a result of formation and 
stability constraints. 

10.6.3 Slightly inclined orbits 

For the case where the planet's orbit is circular, but slightly inclined (so that 
the planet no longer passes across the central chord of the star) the TTV P 
moon thresholds change. This is because the shorter chord traveled by the 
planet across the face of the star results in a shorter transit and observing 
duration. In particular, instead of being 2R s /vt r , the transit duration is 
now 2R S (1 — Smin) 1 / v tr an d instead of being 2{R S + a m )/vt r the observing 
duration is now 2(i? s (l — S^in) 1 ^ 2 + a m)/^r- To investigate this effect, 



the thresholds in figure |10.5 were recalculated assuming 5 m in = 0.5R S and 



are presented in figure 10.6 The effect of this change will be discussed by 



comparing with the thresholds calculated in the previous section. 



10.6.3.1 Dependance of threshold on moon semi-major axis 

For the case of slightly inclined orbits the dependance of the threshold on 
semi-major axis varies slightly from the dependance for the case of an aligned 



orbit. As discussed in section 10.4, we expect that for the case of a transit 
with 5 m i n = 0.5, the threshold minima would shift from 2Rq, Rq and be- 
tween Rq and 1/2Rq to 1.73-Rq, 0.87i?Q and between 0.87-Rq and 0A3Rq 
for the case of white, filtered and realistic stellar photometric noise respec- 
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Figure 10.6: Figure of the same form as figure |i"CT5" except calculated for 
the case of a slightly inclined (5 m i n = 0.5Rq) planet orbit. As can be seen 
by comparing with figure 10. 5[ the effect of slight inclination in the planet's 
orbit is two-fold. First, the thresholds, and in particular, the threshold 
minima are shifted towards slightly lower values of moon semi-major axis. 
Second, for the case of solar photometric noise, the relationship between 
large planetary semi-major axis and low moon detectability becomes less 
marked (see text). 
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tively. As the minima for the case of white noise are very shallow this 
difference is not obvious. In addition, the minima for the case of filtered 
noise and red noise lie very close to the region where the expression for At 
is inaccurate and inside the region which is the most affected by unresolved 
non-detection spikes, and consequently, this slight change is not immediately 
discernible. To see this change, compare the red and filtered thresholds from 
fig ures|10.5(l)| and |10.6(l)] between a m = 0.002AU and a m = 0.008AU. In fig- 
ure 10.5(1) , the threshold for the aligned orbit, the filtered noise threshold is 



nearly horizontal in this region, while for the case of realistic noise it begins 
horizontal and then attains a slight positive slope. Compare this to the case 
of the threshold for the analogous inclined system shown in figure 10.6(1) In 
this case the threshold for filtered noise and red noise have a positive slope 
throughout this region. This change in gradient indicates that the effect of 
changing 5 m i n from zero to 0.5 is to shift the minimum of the threshold to 
the left, as expected. 



10.6.3.2 Dependance of threshold on planet semi-major axis 



Unsurprisingly, the behaviour of the thresholds for the case of slightly in- 
clined orbits as a function of planetary semi-major axis shows the same 
trend as for the case of aligned orbits. In particular, moons of planets with 
smaller semi-major axes are more detectable than moons of planets with 
larger semi-major axes. However, for the case where the orbit is inclined 
and the noise is red, this progression is not as marked. To see this compare 
the moon radius associated with the red noise threshold minima between 
figures |10.5(j)| and |10.5(1)[ and |10.6(j)| and |10.6(1)| 

To understand the origin of this effect, recall that, inclination of the 



planet's orbit modifies the transit duration, T tra (see equation (7.25)), and 



through that, the observing duration (see equation ( |10.1| )). As the amplitude 
of At depends on (T i, s - T tra ), a quantity does not depend on 5 m i n , this 
change in the position of the threshold is due to the dependance of a e on Tt ra 
and T f, s . In particular, recall from equation (9.13) that a e is proportional 



to <TQT ), S /Ttra- As a result of the super- linear dependance of oq on T t> s for 
the case of red noise, for the case where T f, s is large, small decrease in the 
size of Tobs causes a large decrease in the size of <tq , while for the case where 
T i, s is small, a small decrease in the size of T f> s causes a small decrease in 
the size of ctq. This difference means that for large semi-major axes (large 
T b s ) the relative change in cjq can be larger than the relative change in 
T^a, an d thus the amplitude of a t reduced, while for smaller semi-major 
axes (small T b s ) the relative change in cjq can be smaller than the relative 
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change in Tt ra , and thus, the amplitude of a t increased. This increase in 
the size of a e for large planet semi- major axes and decrease in the size of 
cr e for small planet semi-major axes is the origin of the slight change in the 
dependance of moon detectability on planet semi-major axis for the case 
where the planet's orbit is slightly inclined. 



10.6.3.3 Comparison with formation and stability limits 

As slight inclination changes the position of both the threshold and the re- 
gion of parameter space which is three-body stable only slightly, the overlap 
between the set of detectable moons and the set of moons which can form 
and be stable for the lifetime of the system is effectively equivalent to the 
overlap for the aligned case. Now that the effect of inclination has been 
discussed, we can move onto a discussion of the effect of eccentricity of the 
planet's orbit on the TTV P thresholds and thus the set of realistic moons 
likely to be detected. 



10.6.4 Eccentric orbits 

For the case where the orbit of the planet is eccentric, the transit light 
curve is either stretched or contracted as a result of the different value of 
vtr, the velocity of the planet in the plane of the sky during transit. In 
particular, there are two factors which could alter vt r and consequently affect 
the detection thresholds, the eccentricity of the orbit, and the orientation 
of the orbit. To explore these two effects, two cases were analysed. First 
the case of a planet with orbital eccentricity 0.1 which transits at pericenter 



is examined (see figure 10.7). Second the case of a planet with orbital 



eccentricity 0.1 which transits at apocenter is examined (see figure 10.8). 
This particular eccentricity were selected as first, 0.1 is a representative 
value of eccentricity for extrasolar planets, second, as the size of the stable 
region decreases rapidly with increasing eccentricity, and third as it would be 
of interest to relate this work to the planet CoRoT-9 b as it has an orbital 
eccentricity of 0.11 and is capable of hosting large tidally stable moons 



(Weidner & Home 2010). As pericenter and apocenter are the positions 
on an eccentric orbit where vt r is maximised and minimised respectively, 
these two cases should bracket the behaviour of the thresholds for any other 
orientation. These two cases are compared to the results for circular orbits, 
and then will be discussed in terms of the planet CoRoT-9 b. 
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Figure 10.7: Figure of the same form as figure 10.5 except calculated for 
the case of an eccentric (e p = 0.1) planet orbit, oriented such that the 



transit occurs at periastron. As can be seen by comparing with figure 10.5 
the effect of eccentricity on the thresholds for this orbital orientation is to 
shift the white and solar noise thresholds vertically upward and downward 
respectively. As a result, the three thresholds are closer to each other than 
for the circular case. In addition, eccentricity reduces the size of the three- 
body stable region (hatched), which in turn reduces the set of moons that 



are tidally stable for the lifetime of the system (compare figures 10.5(d) and 
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Figure 10.8: Figure of the same form as figure 10.5 except calculated for 
the case of an eccentric (e p = 0.1) planet orbit, oriented such that the 



transit occurs at apastron. As can be seen by comparing with figure 10.5 
the effect of eccentricity on the thresholds for this orbital orientation is to 
shift the white and solar noise thresholds vertically downward and upward 
respectively. As a result, the three thresholds are more spread out than for 
the circular case. In addition, eccentricity reduces the size of the three-body 
stable region (hatched), which in turn reduces the set of moons that are 
tidally stable for the lifetime of the system (compare figures |10.5(d)| and 
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10.6.4.1 Dependance of threshold on moon semi-major axis 



As can be seen in figures 10.7 and 10.8, the threshold curves for the case 



of eccentric planet orbits have the same general shape as those for the case 
of circular orbits, in particular, have minima at a m = 2Rq, a m = Rq, and 
between \/2Rq and Rq for the case of white photometric noise, filtered pho- 
tometric noise and realistic photometric noise respectively. This is exactly 
what we would expect from the analysis in section 10.4 However, while the 



thresholds have the same general shape they are vertically displaced from 
the equivalent aligned thresholds, with the degree of vertical displacement 
depending on the type of photometric noise and the orientation of the orbit. 
As the amplitude of the photometric noise depends strongly on T t, s , which, 
in turn, depends on the semi-major axis of the planet's orbit, this effect will 
be discussed in greater detail in the following section. 



10.6.4.2 Dependance of threshold on planet semi-major axis 

This scaling of the transit light curve caused by the modified value of vt r 
has different affects for the different types of noise. Consider the case at 
pericenter, where vt r is large. In particular, T tra and T b s are 82% of their 
original length. As signal is proportional to (Ttra — T a b s ), the thresholds will 
increase for the case of white noise (a e oc T^), remain steady for the case 
of filtered noise (a t oc T b s ) and decrease for the case of red noise (<r e is a 
superlinear function of T Q b s ). Conversely, for the case of transit at apocenter 
where vt r is small and Tf ra and T Q b s are 122% of their original length. Now 
the thresholds will decrease for the case of white noise, remain steady for 
the case of filtered noise and increase for the case of red noise. These effects 



can be seen in figures 10.7 and 10.8 



10.6.4.3 Comparison with formation and stability limits 

In addition to the effect that planetary orbital eccentricity has on the shape 
of the detection threshold, it also reduces the size of the stability region 



(compare the size of the hatched regions in figures 10.5 and 10.7), which in 
turn alters moon's longevity and thus the set of physically realistic moons 
each planet could have. Again, the comparison between the thresholds and 
the set of realistic moons will be conducted for each of the four different 
mass planets individually. 

For the case of a ten Jupiter mass host planet, physically realistic moons 
can again be detected for all investigated values of the planet semi-major axis 
for both white and filtered noise. In addition, physically realistic moons can 
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also be detected for the case of white photometric noise and a Jupiter mass 
host. For the case of a one Uranus mass host and a one Earth mass host, 
it becomes even more difficult to detect moons as a result of the smaller 
stability region. Consequently, for the case of planets in eccentric orbits, 
detection of physically realistic moons is only possible for larger, Jupiter- 
sized, planets. 



10.6.4.4 Comparison with the case of CoRoT-9 b 



This analysis is also of interest as a result of the similarities between the sim- 
ulated cases and the detected transiting planet CoRoT-9 b. This 0.84 Jupiter 
mass planet is on an orbit with semi-major axis 0.407AU and eccentricity 



0.11 around a 0.99 solar mass star (Deeg et al. 2010). Consequently, as it 



transits near periastron, it is bracketed by the cases shown in figures 10.7(e 



and 10.7(h) 



Unfortunately, these thresholds as they stand cannot be used to de- 
scribe moon thresholds for this planet for two reasons. First, to construct 
these thresholds it was assumed that the semi-major axis of the moon was 
known prior to detection. Consequently, to create realistic moon detection 
thresholds, this assumption would have to be relaxed. Second, the measured 
photometric error is approximately 4 times as large as that assumed to make 
these plots as first, the instrument used is CoRoT, not Kepler, and second, 
as the host star is a 13.7 th magnitude star, not a 12 th magnitude star. 



However, while the thresholds shown in figure 10.7 cannot be directly 
used to constrain moons around this planet they do indicate that such con- 
straints would be scientifically interesting. For example, while it is likely 
that a TTVp threshold could be used to place limits on moons which could 
be orbitally stable for the lifetime of this system, it is unlikely that it could 
be used to place limits on moons which we think should form, as first, the 



threshold and the dashed Canup & Ward (2006) limit only barely inter 



sect for the one Jupiter mass case analysed, and CoRoT-9 b is 0.84 Jupiter 
masses, and second, as the photometric noise is four times larger. Conse- 
quently, a moon detection would be interesting as it would indicate that 



the formation model of Canup & Ward (2006) is not correct, while a non- 



detection would also be scientifically interesting as it would increase the 



number of planets which follow the Canup & Ward ( 2006 ) criterion from 4 
to 5. 
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10.7 Conclusion 



Detection thresholds were constructed for a range of realistic planet and 
moon parameters. Informed by the results presented in chapters [8] and [9] 
on the properties of At and a e the method of generalised likelihood ratio 
testing was used to derive an expression defining the position of the detection 
thresholds. These thresholds were then investigated analytically for the case 
of a large number of transits and numerically for the case of a smaller, more 
realistic number of transits. It was found that the TTV P moon detection 
threshold is given by a skewed U-shaped curve, superimposed with a comb of 
non-detection spikes, and that moons are more detectable for planets which 
are closer to their host star. The minima of these curves is defined by the 
type of noise contaminating the transit light curves and the inclination of 
the orbit. The depth of these curves is determined by the type of noise and 
is modified by the inclination, and the orbital eccentricity and orientation, 
with the sign of the modification depending on the orientation of the orbit 
and the type of photometric nose. In addition, it may be possible to test 
moon formation theories with this method for the case of CoRoT-9 b. 

Now that the second of the two moon detection methods investigated in 
this thesis has been analysed, we will summarise the work conducted and 
indicate directions for future research. 
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Part IV 

Conclusion 
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Chapter 11 

Summary and future 
research directions 



Over the course of this thesis, the question of the detectability of extra- 
solar moons has been addressed in various ways. First, the pertinent issues 
were introduced by considering the types of moons likely to form and be 
retained around extra-solar planets (see chapter [3]) and the suite of methods 
presented in the literature for detecting them (see chapter [4]). Then, two 
of these methods, pulse time-of-arrival perturbation, for the case of moons 
of pulsar planets (see chapters [5] and [6]) and photometric transit timing, 
for the case of moons of transiting planets (see chapters [7J |8j [9] and 10 ) 
were investigated in detail, in turn. As these two sections of work were 
quite independent, the results and future research directions for each will be 
discussed in turn. 



11.1 Moon detection by pulse time-of-arrival per- 
turbation 



The detectability of moons of pulsar planets was first considered using a 
simple model. In particular, it was assumed that the orbit of the planet and 
moon were both circular and in the same plane. This model was then applied 
to the case of the pulsar planet PSR B1620-26 b and used to exclude the 
existence of moons with mass greater than 0.125 Jupiter masses, a distance 
of greater than 0.46AU away from the planet. This was the first published 
limit on the mass and orbital characteristics of a moon of a pulsar planet 
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and, to my knowledge, the third 1 published limit on the moon of an extra- 
solar planet. 

This analysis was then extended by considering the case where the or- 
bits of the planet and moon were no longer circular and in the same plane. 
In particular, the effects of mutual inclination and eccentricity in both the 
planet's and moon's orbits on the time-of-arrival perturbation were investi- 
gated by deriving expressions correct to first order in the sine of the mutual 
inclination, correct for all values of mutual inclination, correct to first order 
in the eccentricity of the moon's orbit and correct to first order in the ec- 
centricity of the planet's orbit. The results of this analysis are summarised 



in figure 6.8 



Both of these investigations constitute a preliminary analysis into this 
problem and could be built on in a variety of ways. In particular: 

• The moon threshold for the case of PSR B1620-26 b was derived as- 
suming that the timing noise was white, with standard deviation given 
by a weighted average of the values presented in the literature. How- 
ever, the timing noise of pulsars is not necessarily white, in particular, 
it may be red, that is, have an over abundance of low frequency com- 
ponents. As discussed in a different context in chapter [9j red noise can 
affect detection thresholds. Consequently thresholds derived directly 
from the observations would be much more realistic. 

• The expressions for the timing perturbation could be calculated using 
an inertial coordinate system. The expressions for the timing pertur- 
bation presented in chapters [5] and [6] were calculated using a coordinate 
system which was tied to the planet's orbit. This assumption is not 
necessarily realistic as the orbits of the planet and moon may evolve as 
a result of external perturbations e.g. for the case of the PSR B1620- 
26 system, perturbation from the white dwarf. As the timescale over 
which data has been taken is so long, over 20 years, this may be an 
issue. The way to solve this is to use an inertial coordinate system 



and to rotate both the planet and moon's orbits using equation (6.7). 
While this approach would introduce more terms into the expansion, 
it would be more robust. 

The moon detection thresholds calculated for PSR B1620-26 b could be 
extended to include the case of mutually inclined and eccentric orbits. 
Using the methods presented in chapter [6j and using the expressions 



1 The first and second published limits were for the planets HD 209458 ffirown et al. 



2001 1 and OGLE-TR-113b ( Gillon et al. 2006 1. Both these planets are transiting planets 
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given as test cases, it would be relative straightforward to implement 
the expansions presented in section [672] in a computer program which 
produces the time of arrival perturbation as a function of the orbital 
elements of the planet and the moon. Combining these results with the 



stability method presented in Mardling (2008b), would result in a sub 



stantial improvement on the work conducted and thresholds presented 
in chapter [5] 

• Finally, the PSR B1620-26 system contains a pulsar, white dwarf and 
planet. Consequently, as mentioned in the paper, if the planet had a 
moon, the system would be a 4-body system. Consequently, to provide 
more realistic constraints for this system, the full 4-body expansion 
would need to be used. 

Now that the results and possible future research directions have been sum- 
marised for the case of this technique, the main results and and possible 
future research directions of the photometric transit timing technique will 
be discussed. 



11.2 Moon detection by photometric transit tim- 
ing 

This thesis also addressed the detection of moons of transiting planets 
through an in-depth look at the photometric transit timing technique. Ini- 



tially proposed by Szabo et al. (2006), this technique has only been investi- 



gated in two works, Szabo et al. (2006) and Simon et al. (2007). In Szabo 



et al. (2006), a preliminary investigation was conducted using a Monte Carlo 



simulation with 500 realisations to see if moons could be detected. This was 



followed up with the work of Simon et al. (2007) who presented a formula 



for the amplitude of this perturbation. In this context, the work presented 
in this thesis had five main objectives, which will be discussed individually. 



First, in this work some of the assumptions made by Szabo et al. (2006) 



were relaxed. In particular, the assumption that the position of the moon's 
transit was known prior to detection was relaxed to give the assumption 
that the quantity, a m , was known prior to detection. This is a substantial 
improvement as first, it reduces the number of variables which must be 



known a priori from three (a m , T p /T m and / m (0) + w m 
second, be estimated from formation models. 



to one (a r . 



and, 



Second, the definition of r given by Szabo et al. (2006) was expanded to 



give explicit expressions for the perturbation due to the moon (named At) 
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and the perturbation due to the photometric noise (named ej) which are 
accurate for all transiting systems likely to be detected (see appendix [j]) . 
Using this approach the At signal and the noise on that signal could be 
investigated separately. 

Third, explicit expressions for the form of At were derived for the case 
where the moon's orbit was circular and aligned with the planet's orbit, and 
where the planet's orbit was circular and aligned to the line-of-sight, circular 
and inclined with respect to the line-of-sight and eccentric and aligned to 
the line-of-sight. In addition, the case where the moon's orbit was slightly 
eccentric and the planet's orbit was circular and aligned to the line-of-sight 
was also investigated in appendix [IJ In particular, these expressions were 
derived for the case where the motion of the moon during transit was ap- 
proximately uniform, and the velocity of the moon around its orbit was 
much smaller than the velocity at which the planet-moon pair transited the 
star. By comparison with At values calculated by simulating the full light 
curve, it was found that these expressions described most detectable moons 
for low mass (M p < lMj) extra-solar planets. It was found that for the 
case where the moon's orbital velocity was small with respect to the transit 



velocity, At was given by a sinusoid with coefficients tabulated in table 8.4 



The addition of eccentricity into the moon's orbit or a high value of the 
moon's orbital velocity resulted in the addition of higher order harmonics 
to At. These results represent a substantial improvement over the ampli- 
tude measure of Simon et al. (2007) as they describe the behaviour of At 
for a planet-moon pair at any position about on their mutual orbit, and as 
they were derived analytically, the regions for which they fail can and were 
documented. 

Fourth, the error in At due to photometric noise was investigated. To 
begin, a method was presented which allows the probability distribution of €j 
to be determined for any variety of photometric noise given that a sufficiently 
long sample of light curve containing this noise (e.g. out of transit light 
curves) is available beforehand. Then, the case of white photometric noise 
was investigated analytically, followed by an investigation using this method 
of the effect of solar and filtered solar photometric noise. For each of these 
three cases it was found that tj was Gaussian and uncorrelated for the case 
of planets capable of hosting substantial moons. In addition expressions for 
the standard deviation of €j were also derived. For the case of white noise 
the derivation of an expression for a e was of particular interest as it allowed 
direct comparison with the qualitative results of Szabo et al. (2006), who also 
assumed white noise. In particular Szabo et al. (2006) found a relationship 
between decreased exposure time and increased moon detectability. In this 
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work, using the formula for a e it was shown that this result depends strongly 
on the origin of the photometric noise, for example, if the photometric noise 
is shot noise dominated, a t is independent of exposure time, if it is read noise 
dominated, then moon detectability increases with increasing exposure time. 
Also, it was found that realistic solar noise resulted in a dramatic increase in 
the timing error and that while filtering reduced this, it did not completely 
reverse this effect. 

Finally, the expressions derived for the form of At and the behaviour of 
€j were combined using the technique of generalised likelihood ratio testing 
to give expressions for moon detection thresholds and to calculate moon 
detection thresholds for a number of characteristic cases. In particular, 
analytic expressions were derived for the case where the number of transits 
tended to infinity. For this case it was found that the TTV P technique 
was most sensitive to moons located at a m = 2R S , a m = R s and between 
a m = R s and a m = 1/2R S for the case of white, filtered and solar noise 
respectively. In addition, as a result of an analysis of the functional form 
of r, it was found that the threshold should be decorated with a comb of 
non-detection spikes with spacing dependent on a m and width dependent 
on f m (to), a m and N. Both these results are in stark contrast with the 



result of Szabo et al. ( 2006 ) who qualitatively found that moons with larger 
semi-major axes were more detectable. These results were then used to 
analyse a Monte Carlo simulation investigating the detectability of moons 
for small N. It was found that for the test case selected, a 12 th magnitude 
Sun-like host star detected by Kepler, that physically realistic moons could 
be detected for the case of large gas giant (M p > lMj) host planets for 
the case where the light curve is dominated by white noise and very large 
gas giant planets (M p ~ 10Mj) where it is dominated 2 by filtered intrinsic 
photometric noise. In addition, it is suggested that that following up planets 
such as CoRoT-9 b using this technique would be of scientific interest as it 
could place constraints on the moon formation model of |Canup &: Ward 



(2006) 



While these results have substantially built and extended on those pre- 
sented in the literature, there are many ways in which they could be im- 
proved, in particular: 

• The assumption that a m is known a priori could be relaxed. As this 
method currently stands, in order to practically search for moons using 



2 Recall from chapter [To] that for the light curve to be dominated by filtered noise, 
the amplitude of the intrinsic photometric variability of the host star would have to be 
approximately twice that of the Sun. 
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At must fee 
simulated 




Moon Semi-Major Axis 



Figure 11.1: Cartoon of a sample TTV P detection threshold showing the 
region where the threshold is accurate (green), and the two regions where it 
may not be accurate (yellow and red). 



it, a semi-major axis would have to be selected (informed by the sen- 
sitivity of the method and our understanding of moon formation), e.g. 
a m = R s for filtered noise, and that window using to search for moons 
with a m < R s . This is not an optimal approach, and the ability to 
search for the correct sized window would be a distinct improvement. 

Extend the method used in chapter [8] to describe inclined moon orbits, 
in particular, investigate the effect of allowing A p and A m to vary from 
transit to transit as a result of the different chords taken across the 
star by the planet and moon. 

Investigate the effect of correlated noise from other sources on the 
TTVp moon detection threshold. An example of a noise source of 
interest would be that of the CoRoT-9 b host star. 

Investigate the behaviour of the non-detection spikes associated with 
(p. While these were only mentioned in this thesis, it would be useful 
to investigate them using an approach similar to the one used in sec- 



tion 8.4.2.3 This would be of use to first, get a handle on the number 
of undetectable systems as a function of cj> and N, and second, to help 
explain some of the fine structure seen in the detection thresholds (for 
example the wiggles in gaps between the non-detection spikes on the 



right hand side of figure 10.5(i) ). 
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• The TTVp detection thresholds could be investigated for the case 
where the the second harmonic of Ar is important, for example, as 
the orbital velocity of the moon is not small or where e m 7^ 0. For 
the case where additional harmonic is due to the large value of v m , 
this extension results in two main challenges. First, the v m /vt r term 
in the expressions for a t would have to be taken into account. While 
generalised likelihood ratio testing can be used to investigate signals 
with standard deviations which depend on the signal, it is not a trivial 
extension. Second, As the amplitude of the second harmonic of Ar 



depends on a m (see equation (8.73)), the transformation method pre 



sented in appendix [M] can no longer be used and a simulation will have 
to be run for each semi-major axis of interest. However, as Ar can 
be accurately estimated as a function of cj>, it should not be computa- 
tionally challenging to determining accurate thresholds in this region. 



This region is shaded yellow in figure 11.1 



The TTVp detection thresholds could be investigated for the case 
where the expressions derived in chapter [8] fail completely (the moon's 
transit is asymmetric or moon's orbital velocity is of the same order 
of magnitude or larger than the transit velocity). As Ar must now 
be numerically simulated for each realisation of (j), a m and R m , this is 
a very computationally intensive operation. The computational load 
could be reduced by using information on the behaviour of the <f> non- 
detection spikes to select a much smaller set of representative values 
of (j) (5 as opposed to 50), simulating the light curves for these values 
and then investigating 10 realisations for each </> value. While this is 
not a trivial task it has the advantage that it will allow construction of 
realistic thresholds for the case of large (M p ~ 10Mj) planets, which 
are the planets likely to have the largest detectable moons and thus 
opening up a scientifically interesting portion of parameter space. This 



region is shaded red in figure 11.1 



Extend the analysis to include additional significance levels for the 
detection thresholds. One way that this could be done would be by 
running additional Monte Carlo simulations to determine the form 
of the null distribution of 2 log A. To give a context, recall from sec- 
tion 10.6.1 that 60,000 simulations were required to calculate the 99.7% 
confidence limit (3 sigma threshold) for the null distribution. Using 
this method, approximately 2,500,000 simulations would have to be 
run to obtain the 99.99994% confidence limit (4 sigma threshold) with 
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comparable accuracy. Another approach would be to assume that N 
is large and approximate the null distribution with a x 2 distribution 
with three degrees of freedom. 

11.3 Conclusion 

Over the course of this thesis the detectability of moons of extra-solar planets 
has been investigated with particular reference to the pulse time-of-arrival 
perturbation and photometric transit timing techniques. Approximate ex- 
pressions for the moon detection thresholds were derived analytically for 
both cases and compared with moon formation and stability predictions. 
This culminated in a limit being placed on the possible moons of PSR B1620- 
26 b for the case of the pulse time-of-arrival perturbation technique, and 
a sequence of generic thresholds being calculated for moons of transiting 
planets for the case of the photometric transit timing technique. While this 
represents a substantial advance in our understanding of the detectability 
of moons of extra-solar planets, there is still much more to be done! 
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Appendix A 

Definition of variables 



Variable Name Definition 



a m Semi-major axis of the moon's orbit. 

a p Semi-major axis of the planet's orbit. 

A Amplitude of At. 

A Most likely value of A, assuming that there is a moon. 

A\ Relative area of the 1 st sunspot. 

A2 Relative area of the 2 nd sunspot. 

A3 Relative area of the 3 rd sunspot. 

A m Sum of a m over a given transit. 

A m The value of A m for the case where the moon travels 

with uniform velocity vt r across the face of the star 
during transit. 

A p Sum of q p over a given transit. 

A p The value of A v for the case where the planet travels 

with uniform velocity vt r across the face of the star 
during transit. 

a(t) Total photometric deficit during a transit. 

a m (t) Photometric deficit resulting from the transit of the 

moon. 

a n (t) Photometric deficit due to photometric noise. 

a p (t) Photometric deficit resulting from the transit of the 

planet. 
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Variable representing v m /vt r , —v m /vt r , —v p /vt r or 
Vp/vtr depending on whether the equation describes 
the ingress of the moon's transit, the egress of the 
moon's transit, the ingress of the planet's transit or 
the egress of the planet's transit. 
Factor by which the photometric noise of the Sun is 
scaled. In chapter 10, /? is set to 1.9. 
The speed of light. 



A Wigner D-function. See equation (6.8) for the defi- 
nition. 

Projected distance between the center of the star and 
the moon. 

Minimum value of S m (t) or 5 p (t). 

Projected distance between the center of the star and 

the planet. 

Eccentricity of the moon's orbit. 

Eccentricity of the planet's orbit. 

Eccentric anomaly of the moon. See equation (8.14) 

for the definition. 

Eccentric anomaly of the planet. See equation (8.13) 
for the definition. 

TTVp timing perturbation due to photometric noise. 
The error in r* due to photometric noise. 
The error in r* due to solar photometric noise. 
True anomaly of the moon's orbit. 
True anomaly of the planet's orbit. 
Function describing the effect of orbital eccentric- 
ity and orientation on transit duration (see equa- 
tion Q7.40D ). 

Moon eccentricity function. See appendix [d] for more 

information. 

Phase of At. 

Variable representing the position of the moon during 
the j th transit. The exact definitions are given in ta- 
ble 8.1 or |I.1| depending on whether the moon's orbit 
is circular or eccentric. 

Most likely value of 4>, assuming that there is a moon. 
The universal gravitational constant. 
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llmm'{Im) Inclination functions. See appendix [c] for more infor- 

mation. 

I m Inclination of the moon's orbit. 

I p Inclination of the planet's orbit. 

j Transit number. 



Jk(x) A Bessel function of the first kind. See equation (8.56) 

for the definition. 
Jv2 P Love number of the planet. 

I Summation index. 

L(t) Luminosity of a star. 

L* (t) Luminosity of a star where the amplitude of the pho- 

tometric noise has been increased by a factor of f3. 
Lq Average luminosity of a star. 

Lfu(t) Best fit curve to L. 

L n {t) Zero mean photometric noise of a star. 

L r Background intensity due to the solar network. 

Ai Longitude of the 1 st sunspot. 

A2 Longitude of the 2 nd sunspot. 

A3 Longitude of the 3 rd sunspot. 

A The ratio of the probability that data was produced 

under the null hypothesis to the probability that it 
was produced under the alternative hypothesis. 

m Summation index. 

m! Summation index. 

M m Mass of the moon. 

M m (t) Mean anomaly of the moon's orbit. Note that the time 

dependance differentiates this from M m , the mass of 
the moon. 

M p Mass of the planet. 

M p (t) Mean anomaly of the planet's orbit. Note that the 

time dependance differentiates this from M p , the mass 
of the planet. 

M s Mass of the star. 

Mj Mass of Jupiter, 1.90 x 10 27 kg. 

Mu Mass of Uranus, 8.68 x 10 25 kg. 

M e Mass of the Earth, 5.97 x 10 24 kg. 

M Q Mass of the Sun, 1.99 x 10 30 kg. 

fj, e The mean of the distribution of €j. This is equal to 

zero by definition. 
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n Summation index. 

n m Mean motion of the planet and moon about their com- 

mon barycenter. 

n p Mean motion of the planet-moon system around the 

host star. 

n A unit vector directed along the line-of-sight. 

n m A unit vector normal to the plane of the moon's orbit. 

N Number of recorded transits. 

N i, s Number of exposures taken during the observing win- 

dow. 

Ntra Number of exposures taken during the transit. 

Pj m (cos#) Associated Legendre polynomial of degree I and order 

m. Defined in equation (6.4). 
p m Density of the moon. 

p p Density of the planet. 

Q p Q- value of the planet. 

0\ Latitude of the 1 st sunspot. 

02 Latitude of the 2 nd sunspot. 

#3 Latitude of the 3 rd sunspot. 

@eg,m ^mXeg^m ~i~ fmip) ~\~ v/2. 

@eg,p Tl"rnbeg,p /m(0) 7T/2. 

@in,m f^mtin,m ~\~ /m(0) ~i~ 7r/2. 

@in,p ^mtin^p ~\~ /m(0) ~\~ ~\~ Vr/2. 

m Spherical polar angle describing the angular orienta- 

tion of r m (see figure 6.1 ). 

P Spherical polar angle describing the angular orienta- 

tion of Y p (see figure 6.1 ). 

r Jacobian coordinate defined in figure 2.3 

r m Jacobian coordinate directed from the planet to the 

moon. See figure 2.4 

r p Jacobian coordinate directed from the planet-moon 

barycenter to the star. See figure 2.4 

R c The centrifugal radius for the accretion disk around 

a planet. For this thesis it is taken to be equal to 
R H /48. 

R m Radius of the moon. 

R p Radius of the planet. 

R s Radius of the star. 

Rh The Hill radius of the planet. Defined in equa- 

tion Q3.1ip. 
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The Roche tidal radius of the planet. Defined in equa- 
tion Q3.10D . 

Radius of the Sun, 6.96 x 10 8 m. 



2.3 



Jacobian coordinate defined in figure 
The vector from the system barycenter to the star. 
The disturbing function. See equation (2.16). 
Moon eccentricity function. See appendix |d| for more 
information. 

The standard deviation of white photometric noise. 
The standard deviation of ej. 
The standard deviation of £q. 
Time. 

Mid-time of the zeroth transit assuming that there is 
no moon. 

Most likely value of to, assuming that there is no moon. 
Most likely value of to, assuming that there is a moon. 
Time of moon egress, that is, the time at which the 
center of the moon passes off the limb of the star. 
Time of planetary egress, that is, the time at which 
the center of the planet passes off the limb of the star. 
Time of moon ingress, that is, the time at which the 
center of the moon passes onto the limb of the star. 
Time of planetary ingress, that is, the time at which 
the center of the planet passes onto the limb of the 
star. 

Mid-time of the planetary transit for the case where 
the planet has no moon. 

Mid-time of the moon's transit for the case where the 
planet has a moon. 

Mid-time of the planetary transit for the case where 
the planet has a moon. 

The duration of ingress for the case of a planet with 
no moon. 

The duration of ingress for the moon for the case of a 
planet with a moon. 

The duration of ingress for the planet for the case of 
a planet with a moon. 
Orbital period of moon. 
Orbital period of planet. 
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Tp Most likely value of T p , assuming that there is no 



p ±»iUOU iiXVGiJ. VC^LIC ui j-p, 

moon. 

Tp Most likely value of T p , assuming that there is a moon. 

Tf ra The transit duration of a planet with no moon. 

Tt ra ,m The transit duration of the moon for the case of a 

planet with a moon. 
Ttrap The transit duration of the planet for the case of a 

planet with a moon. 
TOAp er t, p {t) Time-of-arrival perturbation due to orbit of the the 

planet-moon system about the pulsar. 
TOAp er t )Pm {t) Time-of-arrival perturbation due to planet-moon bi- 

narity. 

At Time between consecutive exposures. 

At p Time delay in the center of the planetary transit due 

to the motion of the planet about the planet-moon 

barycenter. 

r m First moment of a m over a given transit. 

Tj The r value calculated for the j th transit. See equa- 



tion (7.46) for the definition of r. 



First moment of a p over a given transit. 



r* Defined in equation (9.4). 

At(j') TTVp timing perturbation due to a moon. 

v m Velocity of the moon about the planet-moon barycen- 

ter. 

v p Velocity of the planet about the planet-moon barycen- 

ter. 

vtr Velocity at which the planet-moon barycenter transits 

the face of a star. 
00 Angular frequency of At. 

Cj Most likely value of to, assuming that there is a moon. 

oj m Argument of pericenter of the moon's orbit. 

ojp Argument of pericenter of the planet's orbit. 

Q Angular velocity associated with the rotation of the 

Sun. 

f2 m Longitude of the ascending node of the moon's orbit. 

ftp Longitude of the ascending node of the planet's orbit. 

x m (t) x-coordinate of moon. 

x p (t) x-coordinate of planet. 

y m (t) y-coordinate of moon. 
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Vp(t) 
*im(M) 

1pm 



y-coordinate of planet. 

UJp + Op. 

Spherical harmonic of degree I and order m. Defined 



in equation (6.3 ). 



Spherical polar angle describe the angular orientation 
of r m (see figure 6.1). 



Spherical polar angle describe the angular orientation 
of r p (see figure 6.1). 
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Appendix B 

Important equations 



B.l Detecting moons of pulsar planets 

Definition of TOA per t )Pm '- 



1 1 



t ft' 



dn 



TOA pert , pm = -- c — J o ^ —nOfdL 
Expansion of the disturbing function using spherical harmonics: 

oo I 



(6.14) 



K 



GM m M„AL 



M m + 



1=2 m=-l 



Expressions for TOA pertjPm are presented in equations (6.29), (6.41), (6.51) 



and (6.69) for the cases of circular coplanar orbits, circular mutually inclined 



orbits, slightly eccentric moon orbits, and slightly eccentric planet orbits 
respectively. 



B.2 Detecting moons of transiting planets 

Definition of r: 



Definition of At: 



Tj = to+jT p + At + ej. 



jT p + t + At 



A p T p + A m T n 



A p + A m 



(7.46) 



(7.47) 



(7.59) 
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Approximate expression for At for the case where the moon's orbit is cir- 
cular and coplanar with the planet's orbit: 



At = A cos(uj + 



(B.l) 



where expressions for the coefficients A, uj and <fi are given in table |8.4| 
Definition of e,-: 



Ar) -\~ A 



^— [U - (jT p + t + At)} a n {U). 



(7.60) 



Expressions for standard deviation of €j for the cases of white, realistic and 
filtered realistic noise: 



a f = 47.9s 



a/L 



3.95 x 10" 4 7 V 1 



At 



Lmm 



1/2- 



100(A p + A m ) 
NL 

(DabsV' 2 ( Amy' 

V24hrsy V 13hrs / 



, (9.29) 



a f = 103.7s 



L n N-, 



O^tra 



W0(A P + A r , 



(DtraX 

I 13/ir J 



(2 
0.010 (^-) + 0.277 + 0.714 f^-Y 

V 24hrs J V 24hrs / V 24hrs / 



, (plb 



a e = 53.2s I 



100(j4 p + An 



i -i 



( Dtra 

Vl3hrs 



LoNtra 
-1 / 



-8 X 10 



-3 ( Dobs 

\24hrs 



+ 1.008 



( Dobs V 
V24hrs J 



Definition of threshold test statistic A: 



N 



21og(A) = ^ 



( Tj -{t Q +jT p )f 



at 



N 

£ 



(tj - (to + JT P + i cos^j + 4>))f 



(9.36) 



. {fToTT9l) 
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Appendix C 

Inclination functions 



A spherical harmonic Yi m can be written as a sum of spherical harmonics 
of the same degree, which have been rotated by the three Euler angles I, lv 
and 0. This process is described in the following equation: 

I 



Y lm {e^)= Y, 2 w( / > w > n ) 1 W( 7r A/) J 



(C.l) 



-1,2 



where the "D® , (I, uj, f2) are the Wigner-D functions used in quantum me- 
chanics. For a more complete review of the use of Wigner-D functions to 
describe inclined systems, please see Mardling (in preparation). 



C.l Derivation of inclination functions 



Following Mardling (in preparation), the Wigner-D functions can be written 

as 

V^ mf (I,co,n) = (-if+™+™' W (/)e i(mWmQ) , (C.2) 

where 



llmm'{I)= ^mm'( COS 



2n—m—m' 



I 



sin ■ 



2l-2n+m+m' 



(C.3) 



and 



«0) _ (_iyi 



I + m 
n 



I — m 



n — m — m 



, (C.4) 



' '(/ + m')\(l - m')\ 
(I + m)\(l - m)\ 

min [I + m, I + m'] . For con- 
venience, these Jimm'(I) have been tabulated in table C.l for the case of 
I = 2. 



where n m j n = max [0, m + m'] and n r 
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m m! Jl mm '{I) 



2 


2 


2 


I fl 4- cos I) 2 









i^sm 2 / 






-2 


±(l-cos/) 2 


2 


1 


2 


-2 sin I (1 + cos/) 









iy§sin(2/) 






-2 


h sin / (1 — cos/) 


2 





2 


±y§sin 2 / 









P 2 (cos/) 






-2 


^sin 2 / 



Table C.l: Table of inclination functions 7j mm '(Z). While 7^ mm '(/) is also 
defined for negative m, these have not been tabulated as 7/_ mm /(/) can 



easily be generated from 7; m _ m '(-0 using equation (C.5) 



C.2 Properties of inclination functions 



From equations ( |6.3[) an d (6.5 ), it can be seen that Y\- 
this and equation (C.2) it can be shown that 



(-l) m Y,* . Using 



,(J) = (-1)™+-' 7w (J). 



(C.5) 
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Appendix D 

Eccentricity functions 



Following Mardling d2008bb, the coefficients s l ™(e m ) and F^ m (e p ) are denned 



-plra I 



as 



s n ( e m) 



F l n m {e P ) 





/•27T _i 
/ ' m 


e im 




JO a ln 








U) 


27 


Jo rf 





- inM ™VdM m (t), 
e - imf "e inM ^dM p (t), 



such that 



mM m (t) 



,(1+1) 



n=— oo 
oo 



-imfp 



-inM p (t) 



(D.l) 
(D.2) 

(D.3) 
(D.4) 



where r m , a m , / m and M m (t) are the radius, semi-major axis, true anomaly 
and mean anomaly of the moon's orbit and r p , a p , f p and M p (t) are the 
radius, semi-major axis, true anomaly and mean anomaly of the planet's 
orbit 

For the applications investigated in this thesis only the I = 2 terms for 
4 m (e m ) and the I = 3 terms for F l ^ n (e p ) are used. Consequently equa- 
tions (D.l) and (D.2) can be written as 



F-n ( e p) 



1 

2tt 
1 

2^ 



®dM m (t), 



2tt 2 

m imfmg—inMn 

II a m 
2n „4 

Jt e - imf ^ nM ^dM p {t). 



(D.5) 
(D.6) 



291 



D.l Derivation of moon eccentricity functions 



In order for equation (D.5) to be evaluated, f m in the complex exponential 



must be written in terms of M m (t), the mean anomaly. While f m cannot be 
written as an explicit function of M m , both M m and f m can be written in 
terms of the eccentric anomaly of the moon's orbit, E m . Using the relation 
between the mean and eccentric anomalies (Murray & Dermott 1999, p. 
34), 

M = E — es'mE, (D.7) 



it can be seen that equation (|D.5|) can be rewritten as 

s n i e m) 



1 

2^ 



f 277 r jn e imf me -in{E m -e m sinE m ) ^ _ ^ cog £ m ) dEm ( Di 
JO a m 



Using the relation between eccentric anomaly and r m and f m (Murray & 



Dermott , 1999 p. 33) given by 



T m — CL 



"m \\ &m COS Ei 



cos E m - 

COS f m = 



1 — e m cos E, 



(D.9) 
(D.10) 



Til 



it can be seen that r m e im ^ m can be entirely specified by E m . While analytic 
solutions to equation (D.8), using equations (D.9) and (D.10) exist for n = 0, 
for the dependence of the Fourier coefficients s^ m (e m ) on 6 m must 



be determined numerically, 
table EH 



This dependence up to order e m is given in 



D.2 Properties of moon eccentricity functions 

There is some redundancy in the coefficients s l ™(e m ). This can be seen by 
taking the complex conjugate of equation (D.3), 

l 00 
!^n e -imf m = <y s l ™(e m )e- mMm{t) , (D.ll) 

" t n=— oo 

where * denotes complex conjugation, and comparing it to the equation 
where m and n have been replaced with their negative values, 



m e -im/ m = ^ s l -™(e m )e- inMm(t) . (D.12) 
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2 2 



2 



5 

4 

3 

2 

1 



-1 

3 

2 

1 





2 



,3 
24 m 



— 3e 

5„2 



_ 19 P 3 
8 in 
5„2 
2 m 

m 



JL P 3 
24 m 

4°m 



1 + 3-e 2 



+ ±e 3 
^ 8 m 



Table D.l: Table of the dependence of the Fourier coefficients s l v m {e rn ) on the 
moon's orbital eccentricity to order e^. While the coefficients are defined 
for negative m, they can be derived from coefficients with positive m using 



equation (D.14). 



Reversing the order of summation gives 
J 



m —imfn 
i 1 



£ s l -™(e m )e 



inM m (t) 



(D.13) 



By comparing coefficients of like exponentials in equations (D.ll ) and (D.13) 
we have that 



Am* i 



A—m 



(e m ). 



(D.14) 



D.3 Derivation of planet eccentricity functions 



As in the case of s„ (e m ), in order to evaluate equation (D.6), both M p (t) 



and f p need to be written in terms of the eccentric anomaly E p . Again this 
can be done by noting that 



cos/p 



a p (1 — e p cos E p ) , 
cos Er, — e„ 



1 



e p cos Ep 



(D.15) 
(D.16) 



Again, while analytic solutions exist to equation ( D.6| ) using equations (D.15 ) 
and (D.16) for n = 0, for n / the dependence of F v m (e p ) on e p must be nu- 
merically determined. This dependence up to order e p is given in table D.2 
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m n 



Jhnf 



m n 



F 



IJrnf 



163 ,3 
4 C P 
127 p 2 

8 3 

l-6e 2 

~~ e P "i i e p 

±e 2 
8 C P 





145 p 3 
6 e P 

±e 3 

3 S 
4 C P 

±e 3 

2 C P 

1-e 2 
4e* 



6 P 
53 p 2 
8 C P 



3 3 



3 2 



6 

5 

4 

3 

2 

1 



5 

-1 



1 

2 

3 

4 



5e„ 

43 2 

4 C P 



3 



4 

3 

2 

1 



-1 

-2 

3 

2 

1 





11^3 



2e p + 4 

I + 2^ 

e P + 2 e p 

II 2 
8 P 
23 P 3 

12 P 
23 § 

4 e p 
2 C P 

+ ^e 3 

^ 4 C P 

l + 3e 2 



2e T . 



Table D.2: Table of the dependence of the Fourier coefficients F^ n (e p ) 



on the 

While the coefficients are defined 
for negative m, they can be derived from coefficients with positive m using 



planet's orbital eccentricity to order e 3 . 



equation (D.20). 



D.4 Properties of planet eccentricity functions 

As with the case of the s l ^ n (e m ), the coefficients F l ^ n {e p ) are not independent. 
Taking the complex conjugate of equation (D.4) gives 

(J+i) oo 
a P e imf p= £ ^(e p ) e » M P W. (D.17) 

'p n=— oo 

Considering the expression where m and n have been replaced by their 
negative values 

Jl+l) -co 
P Jrnfp _ ST^ wl-m (p \JnM p (t) 



,0+1) 



i ? ^ m (e P )e inM " (t) , (D.18) 



and the order of summation has been reversed, gives 

D tmf p 



<-•'■- Yl ^in m (e P )e inMpW . (D.19) 



„(/+!) 
p n=— oo 
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It can be seen by comparing like coefficients in equations (D.17) and (D.19) 
that 

(D.20) 



F^*(e p )=FL- m (e p ). 



I— mi 
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Appendix E 

Analytic estimate for the 
duration of transit ingress 
and egress 



For this thesis, we would like to be able to compare detection thresholds for 
the four methods proposed in the literature for detecting moons of transiting 
planets. However, the thresholds corresponding to two of these methods, 
barycentric transit timing and transit duration variation, depend on the 
duration of the ingress and egress of transit, that is, the time it takes for a 
planet not overlapping with the disk of its host star to completely overlap 
with it, and vice versa. Consequently, in order to perform this comparison, 
we need an expression for the duration of transit ingress and egress for the 
case of circular orbits, the comparison case considered. In particular, to 
fit with the other expressions derived, we would like this expression to be 
formulated in terms of the planetary radius, the stellar radius, the transit 
duration and the impact parameter, and also be simple enough to allow 
insight into the physics. 

For the case of circular orbits, the transit light curve is symmetric, and 
thus the duration of ingress and egress are equal. Consequently, for this 
appendix, we will concentrate on deriving the duration of ingress only, but 
note that the expressions derived can be just as equally applied to the egress. 
First, we note that the duration of ingress can be written in terms of the 
distance travelled by the planet during ingress and the velocity at which the 
planet travels, that is, 



Vtr ' 
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(a) Exact geometry. 



(b) Approximated geometry. 



Figure E.l: Schematic of the position of a planet on the face of its star, at 
the beginning and end of transit ingress. The limb of the star is shown as a 
thick black line while the position of the planet at the beginning and end of 
ingress is shown by two circles. The path taken by the center of the planet 
is shown as a dashed line, and the center of the star is treated as the origin, 
and labeled O. 



where the distance DB is defined in figure E.l (a) and where Vf r is the 
velocity of the planet along the plane of the sky during transit. Noting that 
the duration of the transit is given by 



-tra 



2CA 

Vtr 

2jm-5i, 



Vtr 



we can write equation (E.l) as 



Ttra - 



DB 



(E.2) 
(E.3) 

(E.4) 



Thus, in order to determine Tj n , we must determine the distance DB. 



From figure E.l (a) we have that the distance DB is given by DA — BA. 



Using Pythagorus, we have that 



DA 



(R s + R P ) 2 



5 2 ■ 

mini 



(E.5) 
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and 

BA 

Consequently we have that 



(Rs - R P ) 2 - Si 



DB = J (R s + R p ) 2 - S„ 



and thus from equation (E.4) 



Rp) 2 



(Rs - Rp) 2 - s 2 



(Rs - Rp) 2 - S 2 



- tra ' 



2JRI-51 



(E.6) 



(E.7) 



(E.8) 



While this expression is exact, it is functionally complicated. As we are 
deriving this formula with the final aim of gaining insight into the effect that 
planet size, transit duration and impact parameter have on moon detection 
thresholds, it would be useful to use an approximation to this equation 
which describes its general behaviour, but is more physically insightful. In 
particular, as planets are in general much smaller than their host stars (R p = 
0.103-Rq for Jupiter and R p = 0.0092i?Q for Earth), we can use this to 



obtain such an approximation of equation (E.7) and consequently a more 
useful expression for Tj n . 



This approximation can be derived either by expanding equation (E.7) 
and only retaining terms of order R p /R s , or by using an approximation to 



figure E.l(a) As both these approaches give the same result, we will derive 



an approximation to equation (E.7) by approximating figure E.l(a) as this 



approach involves less algebra and is easier to follow. We begin by replacing 
the curved limb of the star locally with a straight line (see figure E.l(b) ). For 



ease of comparison, the points which are equivalent to those in figure E.l(a) 



have been marked with a dashed version of the same letter. Consequently, 
by determining D'B' , we obtain an estimate, for DB and consequently, can 
calculate Tj n . 

We begin by considering the angle IC'B'E' . From geometry, we have 
that LO'C A' = a, and consequently, that IC'B'E' = a as the line O'C 
and B'E' are parallel. From trigonometry we have that the distance C'B' 
is given by 



C'B' 



Rn 



sin a 



(E.9) 



Using the same arguments to determine D'C' , we have that D'C' is also 
R p / sin q and consequently that 



D'B' 



2R n 



sin a 



(E.10) 
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or 



sin a 



2Rp 
D'B' 



From AO'A'C we have that 



(E.ll) 



cos a 



Squaring both expressions, and adding them gives 



1 



2R r . 



D'B' 

which, upon simplification yields 

D'B' = 2R n \l 



+ 



R* 



-R.s 



-1/2 



(E.12) 



(E.13) 



(E.14) 



Recall that equation (E.14) is the same expression as the one that would 



have been obtained by expanding equation (E.7) and retaining terms up to 
order R p /R s . In this context, consider equation (E.4), restated below 



tra 



R 2 -5 2 



x DB. 



(E.15) 



As our approximation for DB is only correct to first order in R p /R s , and the 
term in square brackets is zeroth order in R p /R s , Ti n can only be correct 
to first order in R p /R s . Completing this expansion and substituting in 



equation (E.14) for DB, we have that the duration of ingress is approximated 
by 



l 



-in ~ ± tra 



R 2 -5 2 ■ 

1 L s u mtn 



~-2R p 1 



R.i 



'1/2 



2 |t,„ i 



R, 



(E.16) 
(E.17) 



This formula is much more appropriate for the purposes of this thesis 



than equation (E.8). To see this, consider how the behaviour described by 



equation (E.17) compares to the relationships that one would naively expect 
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between the ingress duration and the planetary radius, the transit duration 
and the impact parameter. For the case of planetary radius, we would 
expect that larger planets would have larger ingress durations as a result of 
the longer distance that they have to travel to pass over the limb of the star, 
and from equation (E.17) we see that this is the Ti n is proportional 

to R p . In addition, the fact that Ti n is proportional to T tra is also expected 
as the shape of the transit is affected by the geometry of the transit and the 
scale of the transit is affected by the transit duration. So, any of the factors 
which affect the transit duration, either by altering the velocity of the planet 
during transit (e.g. the eccentricity or semi-major axis of the planet's orbit) 
or by altering the distance that the planet has to travel during transit (e.g. 
the impact parameter of the transit) can affect Tj n by way of this term. 
Finally we would also expect that the ingress duration would increase as 
5 m i n increases as the slope of the segment of stellar limb that the planet 
must traverse becomes increasingly shallow. Noting that to zeroth order 
in R p /R s , T tra oc (l — (5 m i n / Rs) 2 ) 1 (see equation ( E.3| )), we find that 
Ti n oc (l — (Smin/Rs) 2 ) ^ 2 • Thus equation (E.17) does display the expected 



increase in Ti n with increasing 5 m i n - Consequently equation (IE. 171) is a 
suitable equation to derive the detection thresholds shown in chapter ij 



1 Note that this formula fails for 5 m %n ~ Rs as a result of approximating the circular 
stellar limb with a straight line. In particular for the case of 5 m in = Rs, the stellar limb 
would be approximated by a horizontal line and the infinite transit duration implied by 
equation (E.17I physically corresponds to a planet traveling along this horizontal line. 
The discrepancy between the true ingress duration described by equation |E.8l and the 
approximate ingress duration described by equation (E.17 1 is only a problem for planets 
with very grazing transits. 
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Appendix F 



Effect of incorrect value of 
unobscured flux on r 



Light curve quantities such as r depend on the value assumed for the unob- 
scured flux of the host star. As a result of factors such as inherent photo- 
metric variability and small number statistics, the value of this flux cannot 
be exactly observationally determined. In this appendix the effect of this 
error on r will be investigated, with the aim of showing that this source of 
error can be effectively neglected. 

To begin, consider the light curve shown in figure [FTTj In particular note 
that the observationally selected unobscured flux (dashed) is not correct, and 
in this case, is slightly above the actual unobscured flux. In order to analyse 



the effect of this error on r, the definition of a given by equation ( 7.48 ) must 



be updated. With this in mind we introduce a photon deficit ctf, associated 
with selecting the wrong unobscured flux, such that a is now 



a 



(F.l) 



where we recall that a p , a m and a n are the photon deficits associated with 
the transit of the planet, the transit of the moon and the photometric noise 



respectively. This breakdown is shown in figure F.2(a) 



To investigate the effect of this additional term we will use an approach 



4 



equivalent to that used in section 7.4.5 to write Ar in terms of A 
and r m . To begin, we group a p and aj into a single term (see figure F.2(b) ), 
such that equation (F.l) becomes 



a 



p + a/) + a m + a r> 



(F.2) 
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^ Time 

Figure F.l: A cartoon of a possible light curve which could be produced 
by a planet-moon pair, showing both the theoretically predicted (black line) 
and observationally measured (black dots) luminosity. In addition, the ob- 
servationally estimated unobscured flux is also shown (dashed line). 



Using the same method as used in section 7.4.5 it can be shown that 



^* K + a/ ) V.fap + a f ) + 4^ a m 

J2i( a P + a f) + a m 

S» to p V a + Si ta f V at + Si tom V a 



E, to - 



At + t + jT p 



Ei "p + a/ + 



a. 



(F.3) 
(F.4) 



Noting that as the a p and are symmetric and centered on the transit 

window, J2i(t a p) I 'Yli a p an d Ei(^ a /)/ Yli a f are b°th equal and given by 
t p . Consequently 



Ar + t + jT p 



i~p{Ap + AAp) + T m A r 
+ AA V + 



(F.5) 



where the definition of r p , r m , A p and ^4 m given in section 7.4.5 have been 
used, and where we define AA p = ]T\ a f- 

Consequently, the effect of a small error in the baseline is equivalent to 



a small error in A p , AA p . Now, recall from the discussion in section 8.4.1.1 
that At is approximately proportional to A m /(A p + A m ). Consequently, 
using the binomial expansion we expect the error in At caused by select- 
ing the wrong baseline to be a factor of AA P /A p smaller than At. As a 
statistically significant difference between the base of the transit and the 
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(a) Planet transit and flux error separate, (b) Planet transit and flux error combined. 



Figure F.2: Figure showing two different ways of dividing the composition 
of a transit light curve. The constituent components (moon, planet and 
error due to incorrect assumed value of the unoccluded flux) which go into a 
transit light curve are shown above, and the full light curve is shown below. 
In particular, any region in which there is a photon deficit is shaded grey. 
In addition, the beginning and end of the observation window are shown 
by vertical dashed lines, while the line of symmetry of each component is 
shown by a vertical dotted line. 

unobscured flux must be detected to confirm a planet, we would expect the 
error in the unobscured flux to be much smaller than the dip depth, and 
consequently, the error in A p to be much smaller than A p , and thus, the error 
in At to be much smaller than At. Thus, the small changes in A p caused 
by an incorrect baseline should not strongly affect the measured value of 
At. Consequently, this error can be ignored. 
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Appendix G 

Proof that the shape of a 
transit light curve does not 
depend on Q v 



In chapter [8j the coordinate system is rotated by £l p for convenience. How- 
ever, in order to justify this approach it must be shown that the shape and 
mid-time of the transit light curve are unchanged by such a transformation. 
This will be done by considering the dependance of L(t), the measured lu- 
minosity of the star on S p (t), the projected distance between the planet and 
the center of the star, and then showing that 8 p (t) is not dependent on Q p . 



To begin, consider equation (7.41), the expression linking the measured 
luminosity and S p (t) 

L(t) = L -a p (5 p (t)). 

As Lq is a constant related to the star (and thus doesn't depend on Q p ) any 
effect that Q p has on the light curve will be through the a p term. Recalling 
from equation (7.42) that a p depends only on the geometry, that is, R s , R p 



and 5 p (t), so any dependance on fi p must be through the 5 p (t) term. 



Recalling that for Part III of this thesis, the xy plane is defined to lie in 
the plane of the sky, we have that 



8 p (t) 2 = x p {tf + y p (tf 



(G.l) 



Substituting in equations (|7.7|) and (7.8) for x p {t) and y p (t) and simplifying 



gives 



Sp{t? = r p (t) 2 cos 2 (f p (t) + oj p ) + r p (t) 2 cos 2 I p sin 2 (f p (t) + u p ), (G.2) 
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V 



Figure G.l: A cartoon showing the path taken by the planet across the face 
of the star (dashed line) for three orbits, which vary only in their J) p values. 
Note that while the path changes, the intensity along it does not. 



where the time dependance has been explicitly written. Note that while the 
expressions for x p (t) and y p (t) depended on Q p , this expression is indepen- 
dent of Q p . As both Lq and 5 p (t) and thus a p (5 p (t)) are independent of O p , 
L(t) must also be independent of Q p . 

This is exactly what we would physically expect, as £l p merely rotates the 
chord taken by the planet across the face of the star about the center of the 
star (see figure G.l). As the measured intensity depends on the projected 
distance from the center of the star only, this should not modify the intensity 
along the chord and thus not modify the transit light curve. 
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Appendix H 

Estimate of probability that 
a planet on an eccentric orbit 
has a higher transit velocity 
than a planet on an 
equivalent circular orbit 



For the case of a planet on an eccentric orbit, the velocity at which the 
planet passes across the star may no longer be given by a p n p , the value for 
the case of a circular orbit. This difference is of particular importance as, in 
chapter [8] an expansion is used to derive expressions for At which assumes 
that v m /vtr < 0.66. As vt r defines where this expression breaks down, 
discovering the effect of orbital eccentricity and orientation on Vt r , and in 
particular the set of orbits for which vt r > a p n p , is of interest. Consequently, 
the set of orbits (parameterised by ft r , the value of / during transit) for 
which this occurs, and in particular the values of ftr for which the transit 
velocity is equal to a p n p , will be investigated, and used to calculate the 
probability that a given transiting planet will have vt r > a p n p . 

In order to determine which orbits have vt r > a p n p , we need an equation 



describing the transit velocity. While the expression given in section (7.2) 
is exact, it is not very analytically tractable. Consequently we use a simpli- 
fied version derived from angular momentum constraints. From [Murray fe 



Dermott ( 1999 ) we have that 



r p x Vp — r p v perp — ipflpWl — e^, (H.l) 
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where r p and v p are position and velocity vectors of the planet, and where 



u perp 



is the component of the planet's velocity perpendicular to the position 
vector r p . As equation (H.l ) is true for all points on the orbit, equation (H.l ) 
can be evaluated for the case where the planet is transiting. In this case r p = 
r p(ftr) and Vp er p = vtr- Substituting these identities into equation (H.l), 
rearranging, and expanding r p (f tr ) using equation (8.9) gives 



vtr = (n p a p ) 



l + e p cos(/, 



tr 



(H.2) 



This is the expression for transit velocity which will be used in this appendix. 

In order to determine whether it is more or less likely for planets on 
eccentric orbits to transit while their transit velocity is higher than that 
of a planet on a circular orbit with the same semi-major axis, we need to 
determine the values of ft r at which this transition occurs. Setting the 
transit velocity of an eccentric orbit equal to that of a circular orbit with 
the same semi-major axis gives 



1 + e p cos(/, 



tr 



1 



(H.3) 



which gives, after rearrangement 



f tr = ± COS 1 



±fp lira- 



l-ej-l 



(H.4) 
(H.5) 



As y 1 — e p ~ 1 < 0, the argument of the inverse cosine is always nega- 
tive. Consequently, the range of angles for which the transit velocity in the 
eccentric case is always described by f P) u m > vr/2. Also, as e p tends to 1, 
equation (H.4) simplifies to give f p = ±/ Pj Hm = ^ z7r - 

As the probability, for a planet to transit is inversely proportional to the 
distance between the planet and star at the time of transit (see section 7.2), 
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Figure H.l: Plot of the probability that a planet on an orbit will transit its 
host star with a velocity higher than that for a planet on a circular orbit 
with the same semi-major axis. The shaded regions indicated the probability 
that a planet with that eccentricity will be observed with transit velocity 
within 20 percent of that for the equivalent circular orbit. 



the probability that a planet on an eccentric orbit transits is described by 



r i 

Ptot oc / df p , 
J-* r P (fp) 

™ 1 + e p cos(/ p ) 

Op(l - ej) 

f p + e p sin(/ p ) 

Op(l - ej) 

2vr 

a p (l-ej)' 



dfp, 



(H.6) 
(H.7) 
(H.8) 
(H.9) 



In addition, the probability that a planet will transit only when its transit 
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velocity is larger than that of the equivalent circular orbit is described by 

;df p , 



P> oc 



-fp,Um r p(fp) 

f P ,u™ 1 + e p cos(f p ] 



fp,lir, 



a p {\ 



f p + e p sin(/ p ) 
a p (l - ej) 



pi 



fp ,lin 



fp,lin 



Mm 

+ 2e p sin(/ Pi 



lira ) 



a p (l 



(H.IO) 
(H.ll) 
(H.12) 
(H.13) 



As the proportionality constant for both equations is the same, we have 
that the probability that a planet on an eccentric orbit will transit while its 
orbital velocity is larger than that for the equivalent circular orbit, given 
that it transits, is equal to 



Ptot 



fp,lim &p sin(/p 



Urn ) 



TT 



(H.14) 



As e p sin(/ Pi i im ) > f or ir/2 < f p j im < ir, the range for f p j im identified 
above, equation (H.14) can be written as 

P~> fp,lim 



P 



> 



tot 



IT 



Substituting in the lowest possible value for f p jim, tt/2, we obtain 



Ptot 2 



(H.15) 



(H.16) 



Consequently, planets on eccentric orbits have a higher probability of tran- 
siting near their periastron passages while their orbital velocity is higher 
than that of the equivalent circular orbit, than near apastron when their 
velocity is lower. In addition, this probability increases nearly linearly as 
the eccentricity increases (see figure H.l). 
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Appendix I 

Form of TTV^ perturbation 
for the case of slightly 
eccentric moon orbits 



Eccentricity in the moon's orbit can affect the form of At through altering 
the time dependence of the planet and moon's position on the face of the 
star. To investigate this effect, the case where the moon's orbit was slightly 
eccentric (e m <C 1), but aligned to the line-of-sight (J m = and Q, p = Cl m ), 
and the planet's orbit was circular (r p = a p ) and also aligned to the line- 
of-sight (Ip = 0), was examined (see figure 1.1). Again expanding the first 



term of equations (8.15) and (8.16) about t = jT p + to, we have that 



x p = v tr (t - (jT p + t )) - r m (t)— ^- cos(/ m (t) + U} m ), 

X m = V tr (t ~ {jT p + t )) + r m (t)— COs(/ m (t) + W m ), 



(LI) 
(1.2) 



pm 



where r m (t) is described by equation (8.10) and f m {t) is described by equa- 
tions (8.12) and (8.14). To allow easy comparison with the equations de- 
rived for the case of circular moon orbits, it would be useful if the term 
r m (t) cos(f m (t) + Um) could be written in terms of n m and t. 

As we are investigating the signal Ar for the case of small e m , we can 
approximate the r m (t) cos(/ m (t) + oj m ) term using an expansion valid for 
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Moon Orbit 




Figure 1.1: Schematic diagram of the same form as figure |8.4| of the co- 
ordinate system for the case of eccentric moon orbits. In particular, it is 
assumed that I p = n/2, I m = tt/2 and Q, m = Q p . 



small e m . From Murray h Dermott (1999, p. 39-40) have that 

r m = a m - e m a m cos(M m (t)) + 0(e 2 m ), (1.3) 
cos(/ m (i)) = cos(M m (i)) + e m (cos(2M m (t)) - 1) + 0(e 2 m ), (1.4) 
sin(/ m (t)) = sin(M m (t)) + e m sin(2M m (i)) + 0(e 2 m ), (1.5) 

where M m (i), the mean anomaly of the moon's orbit is equal to n m t+M m (0), 
and where we note that the explicitly stated time dependance differentiates 
between M m (t), the mean anomaly of the moon and M m , the mass of the 
moon. Expanding the cosine function and using these expressions gives 



r m (t) cos(/ m (i) + u m ) = a m cos(M m (t) + u m ) 

1 3 

+ e m a m - cos(2M m (t) + u m ) - e m a m - cosu m + 0(e 2 

Thus, to first order in 6777,, equations and. ( |I.2[ ) become 

Mr, 



V tr (t - (jT p + t )) 



"M 



cos(M m (t) + oj Ti 



pin 



1 M m , n% , , , . 3 M m 

e m a m -TJ — cos{2Mm{t) + ui m ) + e m a m -^ — cosw m , 



'2 M, 



pin 



'2 M, 



pm 



(1.6) 



(1.7) 
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X m = V tr (t ~ (jT p + t )) + a mTF^- COs(M m (t) + 0J m ) 



1 Mm 



2 M 



cos(2M m (t) + 



3 M„ 



pm 



2M, 



cosw m . 



pm 



Rearranging, writing M m (t) out in full, and setting x p and x m to the 
location of the star's limb gives 



R s = v tr (tin,p - (jT p + t )) - a 



+ M m (0)+uj m ) 

ivi r) 



± pm 



M„ 



"M, 



pin 



1 3 

- cos(2(n m t iriiP + M m (0)) + u m ) - - cosu r , 



, (1.9) 



(jT p + t )) + a ™^^" cos(n m t + M m (0) + CJ m ) 



+ 6m Or, 



M, 



pm 



1 3 

— cos(2(n. m tj nim 



(1.10) 



M„ 



-Rs = v tr {teg, P - (jT p + t )) - a m — — cos(n m t egtP + M m (0) + £J m ) 



p/7i 



Cm.Q j r 



' M, 



pm 



1 3 

- cos(2(n m t e9iP + M m (0)) + w m ) - - cosw,-. 



(1.11) 



R s = vtr{t eg ,m - {jTp + t )) + a m ^ — cos(n m t e9im + M m (0) + 



pin 



M„ 



pm 



1 3 

- cos(2(n m t eSjm + M m (0)) + oj m ) - - cosw m 



(1.12) 



Comparing equations (1.9) to (1.12), with equations (8.43) to (8.46), the 



equivalent equations for the case of circular and coplanar orbits, it can be 
seen that they are very similar. In particular, noting that for circular orbits 
fm(t) = M m (t) (and thus / m (0) = M m (0)), the first line of each of equa- 



tions il.9h to (1.12) are exactly the same as the corresponding equations for 



circular coplanar orbits, while the second line is proportional to the eccen- 
tricity of the moon's orbit (and would consequently vanish for the circular 
case) . 
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Again substituting 9 iri}P = n m t in%p + M m (0) +tt /2+u> m , 9 eg:P = n m t eg , P + 
M m (0) + tt/2 + 

n m teg,m + M m (0) + 7r/2 + w m , we obtain, 

- + n m (jT p + t ) + M m (0) + ^ + w m = 6» irii p - ^ sin 9 inp 

V tr 2 Wtr 

+ ( 2 cos ( 2 ^n,p - w m ) + - cosa; m J , (1.13) 



n m i?. 



7T 



+ M m (0) + - + oj m + n m (jT p + t ) 

V tr 2 



Vtr 



'in,m ' Sin "in,m 

Vtr 



UJ r , 



+ - COS W r , 



(1.14) 



""' ^ + M m (0) + ~ + ui m + n m (jT p + t ) = e9)P 



sinl 



Ufa 



+ e. 



Vtr 



cos(20. 



+ - COS 0J n 



(1.15) 



+ M m (0) + ^ + w m + n m (jT p + t ) = 9 eg ,m + — sin ( 

Vtr * Vtr 



eg,m 



Vtr 



1 3 

- cos(26» e9im - u! m ) + - coso; r , 



(1.16) 



Each of these four equations is mathematically equivalent to 



$ = 9 ec + B 



sin ( 



c,„ I -cos(26> e 



w m) + - COSCJ r , 



(1.17) 



where $ and -B are equivalent to their definitions in section 8.4 (noting that 



/m(0) = M m (0) for circular orbits) and are explicitly given in table 1.1 



While this expression is not analytically soluble, it can be explored using 
a perturbation expansion for the case where e m is small. As 9 ec is likely to 
be similar to the value of 9 CC calculated for the case of circular coplanar 
orbits, especially for the case of small e m , we can write 



9 CC + A9 

ec ■ 



(1.18) 
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X 


<f>x 




in,p 
in, m 
eg,P 
eg, m 


Af m (0) + uj m + § + n m (jT p + t ) - ^ 
M m (0) + u m + f + n m (jT p + t ) - ^ 
M m (0) + w m + f + n m <JT p + t ) + ^ 
M m (0) + w m + § + n m (jT p + t ) + ^ 


Vtr 
% 

fir 
_ Up 

Vtr 
Vjn 
Vtr 



Table LI: The values of $ and B corresponding to equations (1.13) to (1.16). 



Substituting this into equation (1.17), we have that 



= A9 ec - Bsine cc + B 



sm{9 cc + A9 ecj 



r„, | - cos(2(6» cc + A9 e 



0J m ) + g COSUJ r, 



(1.19) 



where equation (8.53) has been used to cancel terms. 

A9 ec = e m g(9 cc ) + ... 



Expanding A9 ec using 



(1.20) 



and substituting into equation (1.19) gives 



= e m g(9 cc ) - B sin 9 CC + B 



sm(9 cc + e m g(9 cc )) 



1 3 

- e m ( - cos(2(6> cc + e m g(9 cc )) - u m ) + - cosw r , 



• (1-21) 



Expanding the sinusoids and gathering first order terms in e m gives, after 
rearrangement 



g( 



1 l?(cos(26> cc — u) m ) + 3 cos u m )) 



1 + B cos 9 C 



(1.22) 



Consequently, we have that 



9 ec = ^ + Y / ^J k (kB)sm(k^) 

k=l 



e rn 3cosu m + cos(2$ + *£T=i Pk{kB) sin(A;$) 



+ — B 
2 



U) r , 



1 + 5 cos(,4 + ZT=i i-h{kB) sin(fc$)) 



, (1.23) 
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As this expression is structurally complex, while an expression for At 
could be derived using this equation, it would provide little physical insight 
into the system. Consequently, expressions for At will only be derived for 
the special case, v m /vt r <C 1, considered in the next section. 



1.1 Form of At 



To investigate the form of At for the case of non-zero values of e m , it was 
decided to consider only the simplest case, that is, where v m /vt r <C 1. While 
more general expressions for At can be derived (e.g. by retaining second 



order terms in v m /vt r and v p /vt r , as in section 8.4), they are much more 



complicated and this case will not be considered in this thesis. 



1. 1.1 Case where v m /v tr <C 1 



For the case where v m /vt r <C 1 and v p /vt r <C 1, equation (1.28) simplifies to 



ec = $ + Bsm(^>) + -yS(3cosw m + cos(2$ - oj m )). (1.24) 

Using the definitions of 9i n>p , 0i njm , Q e g,p and 6 eg ,m and the expressions 
for $ and B in table 1.1 expressions for tinj» Un,mi t eg , P and t eg ^ m can be 
derived. Retaining only first order terms in v m /vt r and v p /vt r gives 



Un, P = jTp + t - — - — — cos (M m (0) + uj m + n m (jT p + t ) 

Vtr n m V tr V 



Vtr 



6m 1 Vp 
2 Ti m Vtr 



3 COS L0 r 



n R 

cos I 2M m (0) + 2n m (jT p + t ) - 2^^ + u v 

Vtr 



(1.25) 



tin,m = jTp+t - — + — — COS ( M m (0) + L0 m + n m (jT p + t ) 
Vtr n m V tr V 



Vtr 



+ 



2 Tim Vtr 



3 COS OJn 



n R 

cos ( 2M m (0) + 2n m (jT p + t ) - 2^^ + u. 

Vtr 



(1.26) 
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P cos ( M m (0) + uj m + n m (jT p + t ) + 



Cm 1 Vp 
2 71/71 



3 COS UJ m 



. Tlm.Rft 



cos ( 2M m (0) + 2n m (jT p + to) + 2^^) + u Tl 

Vtr 



(1.27) 



t eg ,m = jT p +to+— +— — cos ( M m (0) + o; m + n m (jT p + t ) + nTO-Ra ) 

, C m 1 f m / 

+ -7; 3coso; m 

2 n m v tr y 

- cos ^2M m (0) + 2n m (jT p + t ) + 2^-^) + V (1.28) 

These expressions can now be used to construct expressions for r p , r m , 
A p and A m . As e m is small, the eccentricity does not strongly affect the 
moon's velocity. Consequently, the assumption of uniform velocities made 
in section 8.3.2 is still valid, and consequently equations (8.19), (8.20), (8.30) 
and (8.31), the equations defining r p , r m , A p and A m can be used. Substi- 



tuting these expressions into equations (8.19), (8.20), (8.30) and (8.31) and 



again retaining only first order terms yields 
Tp = jT p + t 



— —cos ( nmRs I cos (M m (0) + u m + n m (jT p + t )) 

n m V tr V V tr 



e m 1 Vp 

2 Tl m Vtr 



3 COS LJm 



cos 2 



Vt7 



cos(2M m (0) + 2n m (jT p + t ) + oj m ) , (1.29) 



Tm = jTp + t + 



"m Vtr 



cos 



+ 



Vtr 
1 

2 Tl m Vtr 



cos (M m (0) + oo m + n m (jT p + t )) 
3cosw m 



cos 2 



. Tim. R s 



cos(2M m (0) + 2n m (jT p + t ) + w m ) , (1.30) 
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Ap — Ap + A^ 



1 V tr V F 



sin 



^vn.Rf- 



n m Rs v tr V v t 

&m Ap Vt r Vp . / n m R s 

— sin 2 

2 n m R s v tr V Vtr 



sin (M m (0) + w m + n m (jT p + t )) 



x sin(2M m (0) + 2n m (jT p + t ) + u m ), (1.31) 



1 "U^y Ifyft . / Tt m R^ 

n m R s V tr \ V t 

, &m A m Vtr Vi, b . , 

H sin 2 

2 n m # s v tr V y t 



sin (M m (0) + uj m + n m (jT p + t )) 

^mR-s 



x sin(2M m (0) + 2n m (jT p + t ) + Wm)- (1-32) 

Combining equations (1.29) to ( |L32 ) using equation (7.59) and neglecting 
any terms of order v^/v^., v m v p /v'f r and Vp/vf r or greater gives 



■4pm Mpm 



V^vn.Rt- 



A m Mp A„M m a m 
At = s cos 

Vtr \ V t 



x cos (M m (0) + Lo m + n m (jT p + t )) 

, &m, A m Mp ApM m CL m I 
+ — * I O COS UJ n 



cos 2 



2 

IT-mRs 
Vtr 



ApmMp m 



Vtr 



cos(2M m (0) + 2n m {jT p + t ) + w m ) . (1.33) 



This expression can also be approximated by comparing the relative sizes 
of A p M m and A m M p to give 



At 



Mr, 



A 



M p + M m A p + A m v tr 



cos 



n"mR& 
Vtr 



x cos (M m (0) + uj m + n m (jT p + t )) 



+ 



.4 



m 



2 M p + M m A p + A m v tr 



3 COS bJr, 



- cos (2^^ J cos(2M m (0) + 2n m (jT p + t ) + . (1.34) 

Now that expressions for At have been calculated for the case of eccen- 
tric moon orbits, the effect of this eccentricity on the form of At will be 
discussed. 
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1.2 Effect of eccentricity in the moon's orbit on At 



As can be seen from equations (1.33) and (1.33), eccentricity in the orbit of 
the moon affects At in one very important way. It leads to the distortion 
of the shape of At as a function of transit number, so that it is no longer a 
sinusoid, by introducing higher order harmonics. In particular, for low values 
of e m and v m /vt r , the amplitude of these harmonics is proportional to the 
eccentricity of the moon's orbit, while the phase depends on the orientation 
of the moon's orbit. 

To investigate this effect, a simulation was run for the case of low (0.1) 
v m /vtr, comparing At values calculated from equation (8.70), assuming a 



circular moon orbit, calculated from equation (1.33) assuming an eccentric 
moon orbit, and calculated directly from the simulated light curve. These 
simulations were conducted for the case of "low" eccentricity (e m = 0.1) and 
"moderate" eccentricity (e m = 0.4) and a range of different orbital orienta- 
tions {oj = it/2, it and 3n/2) and are shown in figure 1.2 Recalling from 



table \L2\ that the regular satellites in the Solar System all have eccentricities 
less than 0.06, this simulation indicates that, equation (1.33) accurately de- 
scribes At for a range of realistic, non- negligible eccentricities and a typical 
range of orientations. 
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(b) e m = 0.4, UJ m = 7r/2. 
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(f) e m = 0.4, w m = 7r/2. 



Figure 1.2: Comparison of the value of At calculated directly from simulated 
transit light curves (black), with that of equations ( |8.70 ) and (1.33), the an- 
alytic approximations to At assuming circular (blue) and eccentric (red) 
moon orbits. The orientations selected (uj m = vr/2, ir and 37r/2) physically 
represent orbits with pericenter directions pointing toward the observer, 
along the plane of the sky, and away from the observer, respectively. Fi- 
nally, these plots were constructed for the case of a large gas giant moon, in 
particular, it was assumed that R p = 0.1R S , R m = 0.01i? s and a m = R s . 
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Appendix J 

The effect of second order 
terms on ej for the case of 
white noise 



In chapter [7j expressions for At and ej were derived by performing a bi- 
nomial expansion on equation ( |7.51| ), the equation defining r, and retain- 
ing first order terms in ^a„/ 'Ei( a p + a m)- However, for the case where 
Ej &n is large, due to, for example, a dim host star or a bad pixel, or where 
Ei( a p + a m) is small due to e.g. a small planet or a short transit duration, 
Ej ct n <C Y^i( a P + a m) may no longer hold. 

To explore the effect of neglecting higher order terms in Ej a n j Ei( a p + 



a m ) on ej, the next term in the binomial expansion of equation (7.51) will 
be retained and investigated for the case of white noise. Expanding equa- 
tion ( 7.51J and retaining all terms to second order in a n gives 



t _ Y,i t i( a p( t i) + QLm{tj)) Ej ~ (*o + jT p + At)) a n (ti 



Ei «p(*i) + oc m (ti) A p + A ri 



+ J2 t J2 k (-*» + *o + 3 T p + Ar ) Un(ti)a n (tk) ,j 1 , 
(A p + A m ) 



Consequently, to second order, 

_ Ei (U - {t + jTp + At)) a n (ti) 
€j A 4- A 



+ Ei Efc (-U + t o + 3 t p + Ar ) an(ti)a n (t k ) ^ 
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In chapter [9] it was shown that the the first term in equation (J.2) is 



normally distributed. Consequently, any non-normal behaviour exhibited by 
€j must be due to the effect of the second term. As this term is comprised of 
weighted sums of pairs of a n multiplied together, the simple normal formulas 
can no longer be used to analyse this term. In particular, the author is 
unaware of any method which will provide an analytic description of the 



distribution of e,- using equation (J.2). However, while the shape of the 



distribution cannot be calculated, the mean of the distribution can. 

This can be achieved by noting that the mean of a weighted sum of 
random variables is equal to the sum of the means of the variables multiplied 
by their associated weights, that is, if 



A' 



Y = Y J ^X l , (J.3) 



i=l 

where X\ to ar e random variables, then 

N 

HY = ^PiHi, (J.4) 
i=l 

where \iy is the mean of Y and m is the mean of Xi for i = 1 . . . N. Conse- 



quently, to calculate \i € using equation (J.2), the means of all the individual 
terms need to be calculated and summed. 



There are three types of terms in equation (J.2), terms which are pro- 
portional to a n (U), terms which are proportional to a n (ti)a n (tk) with i 7^ k, 
and terms which are proportional to (a n (ti)) 2 . The contribution of each of 
these types of terms to the mean will be investigated in turn. From the defi- 
nition of dn^ti), 1 we have that the mean of all terms proportional to a n (ti) is 
zero. In addition, as a n (ti) is symmetric and centered on 0, the distribution 
of a n {ti)a n {tk) for i ^ k is also symmetric and centered on 0. Consequently 
these terms also do not contribute to fj, e . However, the terms proportional 
to (a n (ti)) 2 are always greater than or equal to zero and consequently have 
non-zero mean. Neglecting the first order terms and the terms with i 7^ k 
gives 

= Ei (-U + t + jT p + At) a n (U) 2 
(A p + A m ) 



where a n (ti) 2 represents the mean of a n (ti) 



1 Recall that for the case of white noise a n (ti) is normally distributed with mean zero. 
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To determine a n (ti) 2 we begin by considering a random variable X which 
is distributed according to a normal distribution with mean zero and stan- 
dard deviation 1. From the definition of the chi squared distribution we 
then have that X 2 is distributed according to a chi squared distribution 
with one degree of freedom. Consequently it follows that a n (ti) 2 ja\ is also 
distributed according to a chi squared distribution with one degree of free- 
dom. 

As the mean of a chi squared distribution is equal to the number of 
degrees of freedom, we have that a n (ti) 2 = c£- Substituting this into equa- 

J2i(-ti + t +jT p + AT)a 2 L 



tion (J.5) gives 



Me 



(Ap + A n 



(J.6) 



Substituting equation (9.25) for ^^ « gives: 



Eilt (At - At p 



(Ap + A m ) 
(N obs + l)(AT-At p ) 



N obs (N obs +l) _ N obs (N obs +l) 



(Ap + A r 
(N obs + 1) (At - At p ) a 2 L 

(Ap + Am) 



(J.7) 



(J.8) 



(J.9) 



As Ar— Atp is the time difference between the photocenter of the dip and 
the center of the planetary transit, it can be seen that this error only occurs 
when the transit is asymmetric, for example, due to an orbiting moon. 

To demonstrate the effect of neglecting the higher order a n terms, equa- 



tion (|J.2|) was numerically investigated. The mean and the standard devia- 

,,„,, ranging 



tion of ej was calculated for a selection of values of <jl/(A p + A Ti 
from to 4.5. These numerical results were then compared with equa- 



tions (9.28) and (J.9) the first order formula for a e and the second order 



formula for [i e (see figure J.l). 



For reference, the point at which the numerical results start to diverge 
from the predicted curves corresponds to an O.8i?0 planet orbiting a Sun- like 
star, observed with a relative photometric error of 2.2 x 1CT 4 . As sub-Earth 
mass planets capable of hosting stable moons are unlikely to be detected by 



COROT and Kepler, the higher order terms in equation (J.2) can be safely 
neglected. 
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(a) Ar = 0s, N obs = 280. 
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(b) Ar = 1000s, N obs = 280. 
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(c) At = 0s, N obs = 140. 
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(d) Ar = 1000s, N oba = 140. 



Figure J.l: Comparison between equations (9.28) and (J.9), the theoretical 



predictions for the behaviour of <j e (black) and /i e (red) as a function of 
the ratio between the <tl, the photometric noise, and transit depth, and the 
values obtained from a Monte Carlo simulation. Exposure times of three 
minutes were used for all plots. N b s , the number of exposures was selected 
so that the two cases explored corresponded to a central transit at 1AU 
and 0.2AU respectively. 10 6 simulated transits were used to construct each 
mean data point while 5 x 10 5 simulated transits were used to construct 
the standard deviation data points. As ^ a n became more comparable to 
^2(a p + a m ), the probability of J2( a p + a m + ot n ) ~ 0, and consequently 
small denominators increased. Thus, in this region there was no appreciable 
tightening of error bars as the number of simulations increased. Note that 
error bars are only shown for points where the bars are larger than the 
marker. 
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Appendix K 

Derivation of equivalent 
white noise amplitude 



In order to compare the calculated values for a e for the case raw and filtered 
solar photometric noise with those for white noise, a method is needed to 
select the white noise which is the same as the solar photometric noise. 
For this work it was decided to compare white and solar photometric noise 
with the same power. To ensure that photometric variation on timescales 
relevant to transiting systems was included but variation due to longer term 
phenomenon such as the solar cycle were not, a three month span of data 
was used to determine the amount of power per unit time in the raw and 
filtered solar data sets. Using the definition of power 

N 

P n = ^2\a n (ti)\ 2 , (K.l) 

i=\ 

this was found to be 6491 W/m 2 and 5644 W/m 2 respectively. Consequently, 
a three month segment of white noise which is equivalent to the raw and 
filtered solar data should show a power of 6491 W/m 2 and 5644 W/m 2 
respectively. 

Noting that for the case of white noise, 

N 



^2\a n (U)\ 2 = N (a 2 n ) , (K.2) 



8=1 



where ./V is large and (a 2 ) is the expectation value of (a n ) 2 . From equa- 



tion (9.15), the definition of a Gaussian distribution, and the definition of 
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expectation value we have that 

1 f°° 
(aft) = = / x 2 e 

= oi. 

Consequently 




(K.5) 



Thus ol is equal to 0.0518 W/m 2 for the case of raw solar photometric noise 
and 0.0483 W/m 2 for the case of filtered solar photometric noise. Noting 
that Lq is equal to 1367.3 W/m 2 , this give a relative photometric variability 
of 3.79 x 10~ 5 and 3.53 x 10~ 5 respectively. 



y L dx, 



(K.3) 
(K.4) 



328 



Appendix L 



Proof that £q an d Tp tend to 

to and T p as TV — » oo 



To begin, consider a sequence of Tj values described by 



Tj = t + jT p + Acos(ujj + </>) + ej, (L.l) 



where each of the €j are normally distributed, independent, random variables 



with mean zero and standard deviation a e . As discussed in chapter 10 
the process of determining if the sinusoidal perturbation in the recorded Tj 
values is detectable, involves least-squares fitting a linear model (to + jT p ) 
and a linear model plus a sinusoid (to + jT p + A cos(i2>j + </>)) to these values, 
and comparing the residuals. While we expect the fitting parameters of the 
line plus a sinusoid model to tend towards the true values as N tends to 
infinity, it is less obvious what to and T p tend to as N becomes large. To 
investigate this, expressions for to and T p will be derived in this appendix. 
Then, through a consideration of the error on these quantities and their 
limit as iV tends to infinity, it will be shown that for the case where there 
is a detectable sinusoidal perturbation, to and T p tend to to an d T p as N 
becomes large. 
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L.l Expressions for to and T p 

From [Press et ah] ( |1992| p. 656), we have that the coefficients for a least- 
squares linear fit are given by 



E,=iJ 



(L.2) 
(L.3) 



where we note that there is no dependance on a e as it is the same for all 
Tj values. Substituting equation (L.l) into equations (L.2) and (L.3) and 
separating the ej terms from the other terms gives 



Ef=i f EU (*0 + JT P + A cos(uj + 0)) 



to 



}2j=i3 



Efci 3 EfLi j(to + jTp + A cos(ujj + 0)) 



N -2 



+ 



E j= x3 

£jli(E*Li * 2 - i Eili «)ej 



iVEf=iJ 2 



E i= iJ 



(L.4) 



^ Ef=i i(*o + jT p + A cos^j + 0)) 



P 



^E;=ii 2 -(Ef =1 i 



Efci J E*Li(*o + jT p + Acos(ujj + 4>)) 



ivEL^-(EL/ 2 



■0 



^Ef=ii 2 - Ef=ii 



(L.5) 



The first two terms in each of these expressions do not depend on any of 
the ej, and consequently represent the fit that would occur if there were 
no noise on the r,- values. The third term in each of these two expressions 
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consists of a weighted sum of e,- values and determines the degree to which 
the fit is perturbed by the presence of timing noise. We will consider these 
two aspects in turn for the case of large N, and combine them to give a full 
description of the behaviour of to and T p as N increases. 

L.2 Limiting behaviour of to and T p as N — > oo for 
the case where a e = 

For the case where there is no timing noise, i.e., a e = 0, the third term of 



equations (L.4) and (L.5) vanishes, and the equations become 



_ = Ef = ij 2 Ef = i(to + ir p + Acos(^ + ^)) 



T,f=ijT,f=ij(to+ jT p + Acos{uj + tp)) 

— — 2 ' ( L ' 6 ) 



_ N Zj=i 3 (tp + jT p + A cosjujj + 0)) 



T,f=i 3 Y,jLi (to + jT p + A cos(ujj + 4>)) 



iVEf=iJ 2 



(L.7) 



We begin by co nsideri ng equation (|L.6|). 
From section 



9.3.2 



N(N + l)/2 and that T,f=if 



and equations (9.25 ) and ( |9.26 ), we have that Ylj=\ 3 

N(N + 1)(2N + l)/6. However the be- 

and YljLiJ cos(cjj + terms is less ob- 
vious. As a result we will derive expressions for these terms. Consider the 



haviour of the Ylj=i cos(u;j + 



YljLi cos(o;j + (j)) term first. Writing the sinusoid as a complex exponential 
we have 



A' 



^2 cos(ujj + 



N 

^ 2 

N i 



e i(uij+<f>) , e ~i(ujj+t 



A 



-i(j> 



(e- iuJ Y 



(LI 



(L.9) 
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Both of these terms are geometric series, so they can be analytically evalu- 
ated. Recalling that 

A 1 - r N+1 

Var n = a— , 

^-^ 1 — r 



n=0 



and thus that 



we have that 

N 

cos(uj + 4>) = 

3=1 



N 



El _ r N+l 
n 11 
ar = a a, 
1 - r 

n=l 



f<t> \ — e «w(JV+l) gi</> 



(L.10) 
(L.11) 



i - e 



+ 



e - i< P l — e -iw(JV+l) e - 



1 -e" 



(L.12) 



which simplifies to 

AT 

^ cos(a;j + 4>) = 

3=1 

cos — cos(u;(iV + 1) 



cos(w — (/)) + cos(ujN + 4>) 



-cose/). (L.13) 

2-2cosw v ' 

So unless oj = k x 2tt, where k is an integer, (corresponding to the case 
where the moon completes an integer number of orbits per planetary orbit 
and is thus undetectable), this term is of order 1. 

The equivalent equation for the case of 5^=i 3 cos(cuj + (f>) can be con- 
structed from the one derived for YljLi cos(ujj + 4>), by noting that 

d N d N 

^ cos(wj + ((f) - tt/2)) = — ^ sin(wj + <j)), 



did 



3=1 



3=1 



A? 



Thus 

N 



J2jcos(ujj + <(>) 

3=1 



d 

doj 



J2jcO S (ujj + (f)). 
3=1 



cos(> - ir/2) - cos(uj(N + 1) + 4> - tt/2) 



(L.14) 
(L.15) 



2 — 2 cos ui 
- cos(a; - <j) + 7r/2) + cos(u;iV + <p - tt/2) 



2 — 2 cos uj 



cos{(j) — tt/2). 



, (L.16) 
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which evaluates to 

N 



-(N + l) cos(uj(N + 1) + <f>) + cos(w 



+ 



Ncos(loN + (, 
2 — 2 cos u 



2 sinw 



+ 



2 — 2 cos co 

sin — sm(u(N + 1) + 

(2 - 2 cos w) 2 
sin(w — 0) + sin(u;iV + <; 



(2 — 2 cos oj) 



(L.17) 



Consequently YljLi 3 cos(cjj + 0) is of order TV. 



Now that we have expressions for Ylj=i3> Y^j=x3 ■> ^2j=i cos ( w i + 4>) 
and S^Lx j cos(coj + 0) we can evaluate equation (L.6). Substituting these 
expressions gives 

N(N+l)(2N+l) 



to 



6 



Nt + 



o(i; 



A/ - AT(JV+l)(2Ar+l) _ ( N(N+1) 
iV 6 I 2 



w (^±^ + Mjmp±i) Tp + 0(iV ) 



iV 



iV(jV+l)(2jV+l) _ / JV(jV+l) 
6 V 2 



(L.18) 



where the sums involving sinusoid terms have been left in order of magnitude 
notation for simplicity. Canceling like terms and retaining only the highest 
order terms in TV" in the numerator and denominator, gives 



lim to 

N->oc 



N 



~3~ l ^0 

AT 3 _ { N(N+1)\' 



V 



(L.19) 
(L.20) 



Similarly, we can now also evaluate equation (L.7) describing T p . 



jy N(N+l)(2N+l) _ ^ N(N+1) ^ 
N(N+1) 



Nt + 



N(N+1) 



T p + 0(1) 



M N(N+1)(2N+1) _ ( N(N+1) 
iV 6 ^2 



(L.21) 
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where again, the sums involving sinusoid terms have been left in order of 
magnitude notation for simplicity. Canceling like terms and retaining only 
the highest order terms in N in the numerator and denominator, gives 



N*rp 

lim T — — 



4 1 P 

N± ' 
4 



(L.22) 
(L.23) 



Consequently, for the case where a sinusoidal perturbation would be de- 
tectable, that is u) 7^ 2kn where k is an integer, and the standard deviation 
a e is zero, to converges to to and T p converges to T p as N tends to infinity. 

L.3 Limiting behaviour of the error in to and T p as 

N oo 



From equations (L.4) and (L.5) we have that et and ex p , the error in to and 
T p due to timing noise is given by 



£&i(E£ii 3 -j££i*i 



and 



iVEf=ii 2 -(Ef=ii 



2 ' 



neUj 2 - EUi 



(L.24) 



(L.25) 



Using equation (9.17) from section 9.3, these expressions can be transformed 



to give the standard deviations at and or , of et and 6t p , 



cr to 



Eli(E£i< 2 -il£:iO a ^ 



and 



°"T 



(L.26) 



(L.27) 



where we recall that a e is the standard deviation of €j. 
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Consider equation (L.26), the equation for cr to . We have that 



£f=i(£*=i* 2 -j££iW 



a t 



0f o = a, 



0*0 = a e 



/v^JV / Af(AT+l)(2AT+l) . JV(JV+1) V 

V ^i=i \ v J 2 J 



Af 3 (jV+l) 2 (2jV+l) 2 _ Af3(Af+l) 3 (2Af+l) Ar3(AT+l)3(2Ar+l) 



36 



12 



24 



Af 2 (jV+l)(2Af+l) Af 2 (Ar+l) 2 



Consequently, 



lim ot = cr £ 

A 1 "— >oo 



TV 7 N 7 , N 7 



+ 



12 



jV 4 A 1 " 4 
3 4 



0" f 



0. 



12 

2 



A/ 



Similarly, for the case of ot v we have that 
_ y/Ef.iW-E£ii)y 

TT ' = w^L, j >-( E }L I i) > ' 

/ N 3 (N+1)(2N+1) A f3 (jV+l) 2 (2A r +l) A r3 (jV+l) 2 (2A f +l) 2 
V 6 6 + 36 

_ °" e AT 2 (AT+l)(2jV+l) Af 2 (Af+l) 2 

6 4 
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and thus, 



' N 5 N e | N 7 

^<PT P = CTe V V _V 9 ■ (L-38) 
3 4 

= <V^-> (L.39) 

12 

-,.^=, (L.40) 

= 0. (L.41) 

Consequently, the errors in to and T p tend to as N tends to infinity. 



L.4 Summary of behaviour of to and T p as — > oo 

The behaviour of the fitting parameters to and T p as tended to infinity 
was considered using a two- pronged approach. First, the behaviour of to 
and T p for the case where a e = was investigated. For this case it was 
found that to an d T p tended to to an d T p . Then the errors on to an d T p 
were considered for the case of non-zero a £ , and it was found that the size of 
these errors approach as N tends to infinity. Thus, as the average values 



of to and T p (equations (L.6) and (L.7)) do not depend on e,-, this implies 



that to and T p tend to to and T p as N tends to infinity. 
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Appendix M 

Method for transforming 
fitting parameters derived 
from a general r model to a 
specific r model 



In chapter 10 , a number of Monte Carlo simulations are done for the case 
of small N in order to derive detection thresholds. Practically this involves 
simulating a large number of realisations of r, performing a linear and a non- 
linear least-squares fit to each of them and determining the difference in the 
sum of the squares of the residuals of the two fits. As a result of the sheer 
number of fits required to make one plot, 1 and the artificially large value 
of the condition number of the covariance matrix (and its inverse) when 
realistic values for the coefficients are used, 2 it would be useful to perform 
these fits on more general models, and then transform the derived fitting 
parameters into those corresponding to the models of interest. In particular, 



10.5 


10.6 


10.7 


and 


10.8 



different values of semi major axis and 25 different values of A/a E . In addition 51 models 
were used to estimate the percentage of systems which would be detectable at the level of 
99.7% for each point. This would correspond to running 300 x 25 x 51 = 382500 models 
for each of the these 48 plots as opposed to the 25 x 25 x 51 = 31875 models run in total 
using this method. 

2 As T p ^> A, some columns of the design matrix are many orders of magnitude larger 
than others. This makes it numerically difficult to perform the non-linear least squares 
fitting. In particular, for the case of the MATLAB function nlinfit, the computation 
time for a model with T p 3> A, compared to a more general model with T v 
approximately factor of 10 larger as a result of the additional function calls. 
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using such a method, would mean that fewer models are run as one general 
model can describe many simpler models, and each model takes a shorter 
time to fit as a result of the smaller condition number. Such a method 
exists. In particular, it can be used to transform the fitting parameters for 
model with any value of to an d T p to that for any other value of to an d T p , 
given that A/a e , u and (f) are the same. Then, using these expressions, the 
difference in the sum of the residuals squared for the two models can be 
easily calculated. 

To begin, consider a sequence of r values described by 



Tj = t + T p j + A cos(w? + (f>) + e 



j ' 



(M.l) 



where e,- is a normally distributed variable with mean and standard de- 
viation a e . Performing the least-squares linear (no moon) and least-squares 



non-linear (moon) fits detailed in section 10.3.2.3 gives the fitting parame 



ters to arid T p , and to, T p , A, uj and <fi respectively. In particular, recalling 
that at a minimum the derivative equals zero, these parameters are defined 
by 







N 



(M.2) 



and 



N 



= (*0 + T pJ + A cos (^ + <f>) + ej 



3=1 



-(i + f p j + Acos(uj + </>))) , (M.3) 



where x could be either one of to or T p and where x could be any of to, T p , 
A, uj or (j). As will be shown in this appendix, these model parameters can 
be directly derived from a more general model given by 



^4 

Tj = a + bj -\ cos(uj + 4>) + 

0V 



Or 



(M.4) 



where we note that the e,- in equation (M.l) and the ej in this equation are 
the same. By analogy, fitting parameters for this model can also be defined 



by equations similar to equations (M.2) and (M.3). In particular 



N 



^2 ( a + + ^ cos(ujj + (/))+ £ '' 



dx 



3=1 



(o + bj) 



(M.5) 
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and 



7=1 



cos(wj + 0) + 



-(a + 6j + — cos(wj + ^)) ) , (M.6) 



where again we note that x could be either one of a or b and where x could 
be any of a, b, A, lu and <j>. 



By transforming equation (M.5) into an equation equivalent to equa- 



tion (M.2), and similarly, transforming equation (M.6) into an equation 



equivalent to equation (M.3), we will derive equations for transforming the 



fitting coefficients. In addition it will also be shown that the difference of 
the sums of residuals squared can also be easily transformed. 



M.l Expressions for to an d T p in terms of a and b 

Consider equation ( |M.5[ ), 



dx 



( a + bj + — cos(wj + </>) + - (a + bj) J 

j=l \ a e <?e J 



Multiplying this equation by a t gives 







N 



= Qx ^ ( a(Te + b(Tej + A C ° S ^ j + ^ + €j ~ ^ + b(7ej ^ ' ^ M ' 7 ^ 



Adding and taking away to + jT p to each term gives 



8 



N 



-([to + (o - a)<x e ] + [T p + (6 - 6)a e ]i)) 2 . (M. 



This equation is equivalent to equation (M.2) in that if to and T p are 
the solutions to equation (M.2), then they must also be the solutions to this 
equation. In other words, 



*o = *o + (a - a )°"e, 
T p = T p + (6 - b)a e . 



(M.9) 
(M.10) 
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These expressions allow the fitting coefficients defined for the general model 
to be transformed to a particular model. We now consider the case of the 
non- linear fit. 



M.2 Expressions for to and T p in terms of a and b 



Again we begin by considering equation (M.3), 

A' 



d 



3=1 



A 



= ft ( a + bj + — cos (^' + ^) + 



A 

-(a + bj H cos (or? + 0)) 



Multiplying through by cr e gives 



9 



= — ^2 ( acr e + ^^ei + A cos(u)j + (/>) + ej 



dx 

3=1 

— (aa e + 6<7 e j + A cos(cDj + 0)) 
Again, adding and taking away to + jT p to each term gives 

d N 



(M.ll) 



= (*o + J t p + ^ cos ( w i + 0) + e J 



j=i 



-([to + (a - a)(r e ] + [T p + (6 - &)<r £ ]j + icos(a;j + 0)) (M.12) 



This equation is again equivalent to equation (M.3) in that if io 5 T p , A, uj and 
are the solutions to equation (M.3), then they must also be the solutions 
to this equation. In particular, to should be equal to the first term in square 
brackets, T p should be equal to the second term in square brackets, and A, 
Cj and (j) should be the same in both cases. Mathematically this means that 



to = to + (a - a)a e , 
T p = T p + (b- b)a t . 



(M.13) 
(M.14) 



Now we will consider the transformation of the difference of the sum of the 
residuals squared, 21og(A). 
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M.3 Expression for the difference of the sum of 
residuals squared in terms of the difference of 
the sum of residuals squared for the general 
model 

Now that we have expressions for to, T p to and T p in terms of a,b a, b and 
a e we can show that the difference in the sum of the residuals squared for 
the case of the general model can be transformed to that for a physically 
realistic model. In other words, given a set of these differences calculated for 
set of general models, the set of differences which would have been observed 
for the equivalent (the same values of A/a 6 , u and (f>) realistic models can 
be calculated. 

To begin, consider the difference of the sum of squares of the residuals 
for the two models, for the realistic model 

N 

2of log A real = ^2 (*o + T p j + Acos(wj + 4>) + ej - (t + T p j)) 

N 

~ XT C*o + Tpj + A cos(u;j + 4>) + ej 

3=1 

-{k + T p j + icos(wj + <£))) , (M.15) 



and the general model 



N 



3=1 



A 



2 log A gen = V ]ia + bj + — cos(wj + <j)) + — - (a + bj) 



N 



^2(a + bj + — cos(u;j + 4>) + 



Or 



a + bj + — cos(wj + 4>) j j . (M.16) 



Using equations (|M.9l), (|M.10|), (|M.13l) and (IM.14I) to substitute for a, 6, a 
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and b, equation (M.15) becomes 



N 

2of log A rea i = (t Q + T p j + A COs(ujj + </>) + 6j 
J'=l 

-(t + (a - a)a € + (T p + (6 - 6)^)j)) 2 

AT 

-(t + (a - a)cr e + (T p + (b - b)a e )j + Acos(ooj + <f>)) 

which simplifies to 

N 



21ogA rea/ = ^2[a + bj + 



— cos(wj + (j>) + — - (a + £>j) 



A? 



°f ^ I a + 67 H cos(wj + </>) + 

A 



Q + bj H cos(wj + <f>) 

&6 



(M.17) 



(M.18) 



Comparing equations (M.18) and (M.16), it can be seen that 



21ogA reaZ = 2 log A 



gen- 



(M.19) 



Consequently, a set of general models can be simulated, and have their 
2 log A values calculated and recorded. Then, these values can be directly 
applied to any equivalent realistic model. 
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Appendix N 

68.3% and 
thresholds 



95.4% TTV 



In figures N.l to N.4 and N.5 to N.8 we show the 68.3% and 95.4% moon 



detection thresholds equivalent to the 99.7% thresholds discussed in chap- 
ter 10 and shown in figures 10.5 to 10.8 For reference, using the Monte 



Carlo method detailed in section |10.6.1| it was found that the 68.3% and 
95.4% limits on 2 log A were 3.96 ± 0.04 and 8.63 ± 0.09 for the case where 
N = 9 (0.6AU), 4.22 ± 0.04 and 8.99 ± 0.07 for the case where N = U 
(0.4AU) and 4.49 ±0.04 and 9.26 ±0.11 for the case where N = 40 (0.2AU). 
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-While 
R lie 




Miuin Scjn J— [DLtiiH" Axis 1 10 AU) 




Moon Semi-major Axis (10 AU) 



0.5 1 1.5 

Mm mi Seini-rmjor Axis ( 10 AUI 



(a) M p =WMj, a p = 0.2AU. (b) M p =10Mj, a p = 0.4AU. (c) M p =WMj, a p = 0.6AU. 




Moon Semi-major Axis ( 10 AU) 



Ninon Semi-major Axis ( 10 ' AU) 



Ninon Sl-j;i j — n jll j in Axis ( 10 AU) 



(d) M p = Mj, a p = 0.2AU. (e) M p = Mj, a p = 0.4AU. (f) M p = Mj, a p = 0.6AU. 




mil 



-While 

Hkcred 
-Solar 



Moon Semi-major Axis { 10 AU) 



Moon Semi-major Axis (10 2 AU) 




Moon Semi-major Axis (10 AU) 



(g) M p = Mu,a p = 0.2AU. (h) M p = M v , a p = 0.4AU. (i) M p = Mu,a p = 0.6AU. 



-White 
Filtered 




Mmin Semi-major Axis 1 10 AU) 




Nloon Semi-major Asis ( 10 " AU) 



Moon Semi-major Axis (10 AU) 



(j) M p = M , a p = 0.2AU. (k) Mp = M , a p = 0.4AU. (1) M p = M®, a p = 0.6AU. 



Figure N.l: Figure of the same form as figure 10.5 but showing the 68.3% 
thresholds. 
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Moon Semi— major Axis (10 AU) 



(a) M p =10Mj, a p = 0.2AU. (b) M p =10Mj, a p = 0.4AU. (c) M p =WMj, a v = 0.6AU. 
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Moon Semi— major Axis (10 AU) 



(d) M p = Mj, a p = 0.2AU. (e) M p = Mj, a p = 0.4AU. (f) M p = Mj, a p = 0.6AU. 




Moon Semi-major Axis ( 10 AU) 



Moon Semi-major Axis (10 AU) 



Moon Semi— major Axis (10 AU) 
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Figure N.2: Figure of the same form as figure 10.6, but showing the 68.3% 
thresholds. 



345 



Ilk 



-While 
R lie 





Moon Semi-major Axis { 10 AU) 



Moon Semi-major Axis (10 2 AU) 



Moon Semi-major Axis (10 AU) 



(a) M p =WMj, a p = 0.2AU. (b) M p =10Mj, a p = 0.4AU. (c) M p =WMj, a p = 0.6AU. 




Moon Semi-major Axis { 10 AU) 



Moon Semi-major Axis ( 10 " AU) 



Moon Semi-major Axis (10 AU) 



(d) M p = Mj, a p = 0.2AU. (e) M p = Mj, a p = 0.4AU. (f) M p = Mj, a p = 0.6AU. 




Mi Hin Sciiii-nuiiir Axis [10 AU) 



Moon Sl'iiii- ii5.i j< )[■ Axis ( 10 AU| 



0.5 1 1.5 

Moon Semi-major Axis (10 AU) 



( g ) M p = M v , o p = 0.2AU. (h) M p = M v , a p = 0.4AU. (i) M p = M v , a p = 0.6AU. 



White 

Filtered 



Moon Semi-major Axis ( 10 -1 AU) 




Moon Semi-major Axis (10 AU) 



Moon Semi-major Axis (10 AU) 



(j) M p = M , a p = 0.2AU. (k) Mp = M , a p = 0.4AU. (1) M p = M®, a p = 0.6AU. 



Figure N.3: Figure of the same form as figure 10.7 but showing the 68.3% 
thresholds. 
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Figure N.5: Figure of the same form as figure 10.5 but showing the 95.4% 
thresholds. 



348 




■ i 

II 






— While 1 
Filtered 
r — Solur 


JjLu.l 


uUll 



m 


...Hit 


liy While 

yl Filtered 
j Solar 


y 



Moon Semi-major Axis (10 AU) 



Moon Semi-major Axis (10 AU) 



Moon Semi— major Axis (10 AU) 



(a) M p =10Mj, a v = 0.2AU. (b) M p =10Mj, a p = 0.4AU. (c) M p =WMj, a v = 0.6AU. 




Mi H i[i St'ini-ni.iiiir Axis 1 10 AU) 



Moon Semi-major Axis (10 ~ AU) 



Moon Semi— major Axis (10 AU) 



(d) M p = M.,, a p = 0.2AU. (e) M p = Mj, a p = 0.4AU. (f) M p = Mj, a p = 0.6AU. 



-White 
Filtered 




Moon Semi-major Axis (10 AU) 




Moon Semi- major Axis (10 AU) 



Moon Semi— major Axis (10 AU) 



( g ) M p = Mu, a p = 0.2AU. (h) M p = M v , a p = 0.4AU. (i) M p = M v , a p = 0.6AU. 





— White 

Filtered 
Solar 



Moon Semi-major Axis (10 AU) 



Moon Semi-major Axis (10 ~ AU) 



Moon Semi— major Axis (10 AU) 



(j) Mp = M , a p = 0.2AU. (k) M p = Mm, a p = 0.4AU. (1) M p = M , a p = 0.6AU. 



Figure N.6: Figure of the same form as figure 10.6, but showing the 95.4% 
thresholds. 
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Figure N.7: Figure of the same form as figure 10.7 but showing the 95.4% 
thresholds. 
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